Unstructured Data Classification Fresco Play MCQs Answers
Disclaimer: The main motive to provide this solution is to help and support those who are unable to do these courses due to facing some issue and having a little bit lack of knowledge. All of the material and information contained on this website is for knowledge and education purposes only.
Try to understand these solutions and solve your Hands-On problems. (Not encourage copy and paste these solutions)
Course Path: Data Science/MACHINE LEARNING METHODS/Unstructured Data Classification
All Question of the Quiz Present Below for Ease Use Ctrl + F to find the Question.
Suggestion: If you didn't find the question, Search by options to get a more accurate result.
Classification Quiz
1.Identify the unstructured data from the following.
- Image
- Data from mySQL DB
- Excel data
Answer: 1)Image
Quiz on Dataset
1.What kind of classification is our case study 'Spam Detection'?
- Multi label
- Binary
- Multi class
Answer: 2)Binary
- List of Fresco Play Courses without Hands-On | Fresco Play
- HMTL5 Semantics Elements MCQs Answers | Fresco Play
- HMTL5 Semantics Elements Hands-On Solutions | Fresco Play
- Styling with CSS3 Hands-On Solutions | Fresco Play
- Blockchain Intermedio MCQs Answers | Fresco Play
- Blockchain - Potentes Nexus MCQs Answers | Fresco Play
- Azure Essentials MCQs Answers | Fresco Play
- AWS Essentials MCQs Answers | Fresco Play
Quiz on Pre-processing
1.Which pre-processing technique is used to remove the most commonly used words?
- Stopword removal
- Lemmatization
- Tokenization
Answer: 1)Stopword removal
Quiz on Cross validation
1.The cross-validation technique is used to evaluate a classifier by dividing the data set into a training set to train the classifier and a testing set to test the same.
- False
- True
Answer: 2)True
Quiz on Performance Evaluation Measures
1.True Negative is when the predicted instance and the actual instance are positive.
- True
- False
Answer: 2)False
2.True Positive is when the predicted instance and the actual instance are not negative.
- False
- True
Answer: 2)True
Quiz-Final Assessment
1.The following are all classification techniques, except ___________
- SGDClassifier
- SVM
- StratifiedShuffleSplit
- Random Forest
Answer: 3)StratifiedShuffleSplit
2.The classification where each data is mapped to more than one class is called ___________
- Binary Classification
- Multi Label Classification
- Multi Class Classification
Answer: 2)Multi Label Classification
3.The following are pre-processing methods used for unstructured data classification, except _________
- Stop word removal
- Lemmatization
- Confusion_matrix
- Stemming
Answer: 3)Confusion_matrix
4.In a Document Term Matrix (DTM), each row represents ___________
- word
- TF value
- document
Answer: 3)document
5.Imagine you have just finished training a decision tree for spam classification, and it is showing abnormal bad performance on both your training and test sets. Assume that your implementation has no bugs. What could be the reason for this problem?
- Your decision trees are too shallow
- You need to increase the learning rate X
- You are overfitting
- All the options
Answer: 4)All the options
6.Identify the stop word(s) from the following.
- Both "the" and "it"
- "fragment"
- "the"
- "computer"
- "it"
Answer: 1)Both "the" and "it"
7. a) Download the dataset from https://hrcdn.net/s3_pub/istreet-assets/H4_TQkbOj39HUNoBukluIQ/training.txt and load it to the variable 'sentiment_analysis_data'.
b) Give the column names as 'label' and 'message'.
c) Try out the code snippets and answer the questions.
What does the command sentiment_analysis_data['label'].value_counts() return?
- The number of rows in the dataset
- The total count of elements in the 'label' column
- The count of unique values in the 'label' column
- The number of columns in the dataset
Answer: 3)The count of unique values in the 'label' column
8. a) Download the dataset from https://hrcdn.net/s3_pub/istreet-assets/H4_TQkbOj39HUNoBukluIQ/training.txt and load it to the variable 'sentiment_analysis_data'.
b) Give the column names as 'label' and 'message'.
c) Try out the code snippets and answer the questions.
Which of the following commands is used to view the dataset SIZE, and what is the value returned?
- sentiment_analysis_data.shape, (6918, 2)
- sentiment_analysis_data.shape, (6918, 3)
- sentiment_analysis_data.size, (6918, 3)
- sentiment_analysis_data.size(), (6918, 2)
Answer: 1)sentiment_analysis_data.shape, (6918, 2)
9. a) Download the dataset from https://hrcdn.net/s3_pub/istreet-assets/H4_TQkbOj39HUNoBukluIQ/training.txt and load it to the variable 'sentiment_analysis_data'.
b) Give the column names as 'label' and 'message'.
c) Try out the code snippets and answer the questions.
What is the output of the following command: print(sentiment_analysis_data['label'].unique())
- [true false]
- [1 0]
- None of the options
- [yes no]
Answer: 2)[1 0]
10.Choose the correct sequence for classifier building from the following.
- Initialize -> Train -> Predict -> Evaluate
- Train -> Test -> Initialize -> Predict
- Initialize -> Evaluate -> Train -> Predict
- None of the options
Answer: 1)Initialize -> Train -> Predict -> Evaluate
11. a) Download the dataset from https://hrcdn.net/s3_pub/istreet-assets/H4_TQkbOj39HUNoBukluIQ/training.txt and load it to the variable 'sentiment_analysis_data'.
b) Give the column names as 'label' and 'message'.
c) Try out the code snippets and answer the questions.
Is there a class imbalance problem in the given data set?
- Yes
- No
Answer: 2)No
12. a) Download the dataset from https://hrcdn.net/s3_pub/istreet-assets/H4_TQkbOj39HUNoBukluIQ/training.txt and load it to the variable 'sentiment_analysis_data'.
b) Give the column names as 'label' and 'message'.
c) Try out the code snippets and answer the questions.
What kind of classification is the given case study (Sentiment Analysis dataset)?
- Binary classification
- Multi label classification
- Multi class classification
Answer: 1)Binary classification
13.Choose the correct sequence from the following.
- Pre-Processing -> Predict -> Train
- Data Analysis -> Pre-Processing -> Model Building -> Predict
- Data Analysis -> Pre-Processing -> Predict -> Train
- Pre-Processing -> Model Building -> Predict
Answer: 2)Data Analysis -> Pre-Processing -> Model Building -> Predict
14.Inverse Document frequency is used in the term-document matrix.
- False
- True
Answer: 1)False
15.Pruning is a technique associated with __________
- SVM
- Decision tree
- Logistic regression
- Linear regression
Answer: 2)Decision tree
16.Which of the given hyperparameters, when increased, may cause the random forest to overfit the data?
- Number of Trees
- Depth of Tree
- Learning Rate
Answer: 2)Depth of Tree
17.The higher value of which of the following hyperparameters is better for the decision tree algorithm?
- Cannot say
- Samples for leaf
- Depth of tree
- Number of samples used for split
Answer: 1)Cannot say
18.TF-IDF is a feature extraction technique.
- False
- True
Answer: 2)True
19.What is the purpose of lemmatization?
- To convert words into a proper base form
- To split into sentences
- To remove redundant words
- To convert a sentence into words
Answer: 1)To convert words into a proper base form
20.Supervised learning differs from unsupervised learning as supervised learning requires __________
- Unlabeled data
- Labeled data
- None of the options
- Raw data
Answer: 2)Labeled data
- List of Fresco Play Courses without Hands-On | Fresco Play
- HMTL5 Semantics Elements MCQs Answers | Fresco Play
- HMTL5 Semantics Elements Hands-On Solutions | Fresco Play
- Styling with CSS3 Hands-On Solutions | Fresco Play
- Blockchain Intermedio MCQs Answers | Fresco Play
- Blockchain - Potentes Nexus MCQs Answers | Fresco Play
- Azure Essentials MCQs Answers | Fresco Play
- AWS Essentials MCQs Answers | Fresco Play