Data Science Landscape Quiz
As a data Journalist, which of the following tasks are most germane to your role?
Communication
... [Show More] skills
Brainpower
Read More
Previous
Play
Next
Rewind 10 seconds
Move forward 10 seconds
Unmute
0:07
/
0:15
Full screen
Which of the following is one of the most fundamental characteristics of a data scientist?
Having a sense of curiosity about all things
Which of the following are examples of unstructured data? Select all that applies.
Facebook images
Twitter feeds
The Venn diagram that depicts the intersection of Science, Technology and Data has highlighted a cross section known as the 'danger zone.' Which of the following is an accurate depiction of this overlap in the Venn diagram?
Has technology and data experience but no science (analytics) background.
Data Science Methodology Quiz
The eight data science methodology approaches can be viewed as two larger groupings, the second grouping comprises: train, validate, deploy models and the feedback environment. How is this second grouping different in overall approach from the first grouping (business understanding, exploration, transformation and visualization of data)?
The second grouping addresses predictive and prescriptive analytics, whereas the first grouping addresses descriptive analytics.
Which of the following is a true statement?
Data scientists transform data into knowledge to solve business problems.
Data journalists capture domain knowledge for successful business alignment.
Data engineer architect how data is organized and ensure operability.
All of the above are true
Business understanding is the first part of your analytics journey. Which of the following come to mind when you are planning your business approach? Select one or more.
Perform demand planning and supply chain optimization for your offerings across different segments
Reduce costs
If you had to choose one overarching difference between these methodologies in Question 19, which of the following would best depict that difference in approach?
Unlike KDD and SEMMA, CRIPS-DM considers business understanding.
Descriptive tables share which of the following characteristics?
Measures of Central Tendency
Measures of Dispersion
Measures of Distribution
All of the above answers are correct
The data science methodology includes the following stages: (fill in the missing stage) business understanding, data exploration and preparation, data representation and transformation, ________________, validate data models, ______________, and environment feedback.
Train data models, deploy data models
Data Science on the Cloud Quiz
Which of the following is an example of open source visualization and plotting tool or tools?
Matplotlib
Pixiedust
OpenCV
All of the above are correct.
The Profile view, under the Refinery tab of Watson Studio is designed to present you with which of the following pieces of information?
Frequency and statistics
When working with Data Refinery in Watson Studio, you are presented with three tabs: Data, Profile and Visualization. What is the purpose of the Profile view?
In the Profile view, the user can validate the data to see if any features may need further Data Refinery.
The Communities tab of Watson Studio provides which of the following artifacts?
Tutorials
Data Sets
Articles
All of the above are correct.
There are many ideas as to why some data scientists prefer Python over RStudio. Which of the following seems to be the prevailing argument that favors Python over R?
Python is a more generalized language versus R which is more statistics focused.
When using Jupyter Notebooks, inevitably, you will need to import libraries such as NumPy and SciPy. Which of the following integration layers best describes this kind of an activity?
Scientific computing and statistics packages
Explore and Prepare Data Quiz
Hadley Wickham is known for saying "Tidy datasets are all alike, but every messy dataset is messy in its own way." Which of the following statements supports this assertion? Select all that apply.
Avoid redundancy, logical errors, or issues with updates.
Complement programming languages' ability to perform vectorized operations.
Ensure Boolean values are encoded appropriately.
When transforming messy data to tidy data, which of the following is a good practice?
Multiple variables are stored in one column.
Variables are stored in both rows and columns.
Multiple types of observational units are stored in the same table.
All of the above are correct.
Data scientist and data engineers often access RDBMS databases to retrieve data. Which of the following specific tasks is an example of such tasks?
Data scientists access the data via SQL or language-specific libraries.
Data engineers perform a task called ETL (Extract, Transform, Load) where they take data from one source and move it to another.
Use of NoSQL, since it is best for high latency and JSON based storage
All of the above are correct.
You can flag missing observations using machine learning (ML) model. Not all models address missing data equally. Which of the following statements is true regarding using ML models to flag missing data?
Regression models handle summary statistics better.
Tree based models handle outliers better.
Represent and Transform Data Quiz
...
With ____________ data, you have categorical variables that can be described by groups rather than numbers.
Structured
When would you use a histogram?
To understand the distribution of a variable
When would you use a bar chart?
When I want to explore in time
When I have categorized data [Show Less]