Top 25 Data Analytics Interview Questions and Answers - PowerPoint PPT Presentation

About This Presentation
Title:

Top 25 Data Analytics Interview Questions and Answers

Description:

Get prepared before stepping into an interview with this presentation. For more- – PowerPoint PPT presentation

Number of Views:385

less

Transcript and Presenter's Notes

Title: Top 25 Data Analytics Interview Questions and Answers


1
(No Transcript)
2
A Data Analytics deciphers information and
transforms it into data that can offer approaches
to improve a business, in this way influencing
business choices. It accumulates data from
different sources and deciphers examples and
patterns as such a Data Analyst set of working
responsibilities should feature the explanatory
idea of the job.
3
Key Reasons to Become a Data Analytics
4
  • Profoundly popular field
  • Generously compensated and Diverse Roles
  • Advancing working environment situations
  • Improving item guidelines
  • Helping the world
  • Data analytics calling is the most requested in
    2020. It will increment furthermore and most top
    organizations are recruiting like Amazon,
    Facebook, Google, Intel, and Apple, and so forth
    and also one of the quickest developing areas and
    furthermore high paid occupations contributions.
  • This explains dreary errands what human sets
    aside on more effort to take on basic reasoning
    and critical thinking aptitudes. The use of AI
    has empowered organizations to tweak their
    contributions and upgrade client encounters.
    Prescient examination and AI have upset the
    medicinal services industry. It is sparing lives
    by empowering early discovery of tumors, organ
    inconsistencies, and then some.

5
NECESSITIES AS AN INFORMATION EXAMINER OR
BUSINESS INFORMATION INVESTIGATOR
6
  • Technical skill with respect to information
    models, database plan improvement, information
    mining, and division strategies
  • Strong information and involvement in revealing
    bundles (Business Objects and so on), databases
    (SQL and so forth), programming (XML, JavaScript,
    or ETL structures)
  • Knowledge of insights and experience utilizing
    factual bundles for examining datasets (Excel,
    SPSS, SAS and so forth)
  • Strong expository aptitudes with the capacity to
    gather, arrange, investigate, and disperse huge
    measures of data with meticulousness and
    precision
  • Adept at questions, report composing and
    introducing discoveries
  • BS in Information Management or Statistics,
    Mathematics, Economics, Computer Science, etc.,

7
DATA ANALYTICS SALARY IN INDIA
New data expert (1 to 4 years) of
experience  4lakhs/annum
Mid carrier expert (5 to 9 years) of
experience  6 to 7lakhs/annum
Senior or developed bearer (5 to 9 years) of
experience  gt10lakhs/annum
8
THE BEST 25 DATA ANALYTICS INTERVIEW QUESTIONS
AND ANSWERS.
9
(No Transcript)
10
Information cleaning likewise alluded as
information purging, manages to distinguish and
expelling blunders and irregularities from
information so as to upgrade the nature of the
information.
2. WHAT IS INFORMATION CLEANING?
11
(No Transcript)
12
(No Transcript)
13
Information examiners can tailor their work and
answer to fit the situation. For example, if a
maker is tormented with delays and impromptu
stoppages, a demonstrative examination approach
could help recognize what precisely is causing
these deferrals. From that point, different types
of examination can be utilized for fixing these
issues.
  • Data analysts
  • Data analyst have moderate math, factual and
    coding abilities
  • Have a solid business sharpness
  • Develop key execution pointers
  • Create representations of the information
  • Utilize business insight and investigation
    apparatuses
  • Data Scientist
  • Data scientists have solid math and factual
    abilities.
  • Have solid coding abilities and business ideas
  • Identify patterns with AI
  • Make forecasts dependent on information patterns
  • Write code to aid information examination

14
(No Transcript)
15
6. WHAT IS REQUIRED TO TURN INTO AN INFORMATION
EXAMINER?
  • To turn into an information investigator,
  • Robust information on announcing bundles
    (Business Objects), programming language (XML,
    JavaScript, or ETL systems), databases (SQL,
    SQLite, and so forth.)
  • Strong aptitudes with the capacity to
    investigate, sort out, gather and scatter
    enormous information with exactness
  • Technical information in database plan,
    information models, information mining and
    division procedures
  • Strong information on measurable bundles for
    investigating huge datasets (SAS, Excel, SPSS,
    and so forth).

16
  • The following valuable tools for data analytics
  • Rapid Miner
  • Open Refine
  • KNIME
  • Google Search Operators
  • Solver
  • Node XL
  • Wolfram Alphas
  • Google Fusion tables
  • Splunk
  • R Programming
  • Python
  • Tableau

7. RUNDOWN OF SOME BEST INSTRUMENTS THAT CAN BE
VALUABLE FOR DATA ANALYTICS?
17
8. FOR WHAT REASON WOULD YOU LIKE TO BE DATA
INVESTIGATOR?
Generally, this kind of inquiry can fill in as an
icebreaker. Notwithstanding, now and then,
regardless of whether the questioners dont
unequivocally say it, they anticipate that you
should answer an increasingly explicit With
these self-reflective questions, theres not
really a right answer I can offer you. There are
wrong answers, thoughred flags for which the
employer is searching.
  • A few things you probably want to get across
    include
  • You love data.
  • Youve researched the company and understand why
    your role as a data analyst will help it succeed.
  • You more or less understand whats expected of
    your role.
  • Youre confident in your decision.

18
(No Transcript)
19
10. DESCRIBE A TIME WHEN YOU HAD TO PERSUADE
OTHERS. HOW DID YOU GET BUY-IN?
The trick to this question is to demonstrate that
you not only persuaded others of a decision but
that it was the right decision. Sample answer
As a data analyst intern at my last company, we
didnt really have a modern means of transferring
files between co-workers. We used flash drives.
It took some work, but eventually, I convinced my
manager to let me research file-sharing services
that would work best for our team. We tried drop
box and Google Drive, but ultimately we settled
on using Share point drives because it integrated
well with some of the software we were already
using on a daily basis, especially Excel. It
certainly improved efficiency and minimized the
wasted time searching for who had what records at
what times.
20
(No Transcript)
21
(No Transcript)
22
(No Transcript)
23
14. WHAT IS THE DIFFERENCE BETWEEN DATA MINING
AND DATA PROFILING?
Data mining is a process in which you classify
patterns, irregularities, and correlations in
large data sets to forecast outcomes. On the
other hand, data profiling lets analysts observed
and erase data. Sample answer Whereas data
mining is concerned with gathering information
from data, data profiling is concerned mainly
with estimating the quality of data.
24
15. How have you dealt with messy data in the
past? Up to 80 of a data analysts time can be
spent on cleaning data. Even more important when
you consider that, if your data is unclean and
produces inaccurate insights, it could lead to
costly company actions based on false
information. Yikes. That could mean trouble for
you. You want to validate not only that you know
the difference between messy data and clean data
but also that you used that information to
cleanse the data.
25
Sample answer A client of ours was unhappy with
our staffing reports, so I needed to pore over
one to see what was causing their chagrin. I was
looking at some data in a spreadsheet that
contained information about when our call canter
employees went to break, took lunch, etc., and I
noticed that the time stamps were inconsistent
some had a.m., some had p.m., some didnt have
any specifications for morning or night, and
worst of all, many of these employees were
located in different time zones, so this needed
to be made more consistent as well. To solve the
a.m. or p.m. dilemma, I made sure all times were
specified in the military. This had two benefits
first, it eliminated the strings in the data and
made the whole column numeric second, it removed
any need to specify morning or night as military
time does this inherently. Next, I converted all
times to UTC, this way all of the data was in the
same time zone. This was important for the report
I was working on because otherwise the data would
be presented out of order and it could cause
confusion for our client. Reorganizing the
reports data this way helped improve our
relationship with the client, who, due to the
time discrepancies, previously believed we were
understaffed at specific times of day.
26
16. HOW MANY X IS IN Y PLACE?
  • This question takes many forms, but the premise
    of it is quite simple. Its asking you to work
    through a mathematical problem, usually figuring
    out the number of an item in a certain place, or
    figuring out how much of something could
    potentially be sold somewhere.
  • Find how many malls are in a particular city in
    the country?
  • Find how many engineering colleges with adequate
    facilities are available in state/district?

27
Sample answer I believe there are about 10
million people in New York, give or take a couple
million. Assuming each of them lives in a
residential building, with three rooms or more,
if there were one window per room that would make
approximately 30 million windows. Im making a
few different assumptions that are probably
inaccurate. For instance, that everyone lives
alone and that the average size of their
residences is just three rooms with one window
per room. Obviously, there will be a lot of
variations in reality. But I consider, in terms
of residences, 30 million windows could be
close. Then youd have to take windows for
businesses, subway rail cars, and personal
vehicles. If the average subway car seats 1,000
people, with 1 window per 2 seats, thats 500
windows per car. A little more math Id guess
there are at least enough subway cars to support
the whole population of New York so 10 million
divided by 1,000 comes out to 10,000. So there
are another 5 million windows for subway cars. If
half of all people own their own vehicle, thats
another six windows per person, so 30 million
more windows. Id guess there are at least
100,000 businesses with windows in NYC. Lets
just say for the sake of argument theres an
average of 10 windows each. Thats another
million. Im sure theres way more than
that. Overall, were at 66 million windows
(30,000,000 x 2 5,000,000 1,000,000). All of
this pretty much hinges on how close I am to the
actual population of New York City. Also, there
are other places to find windows, such as busses
or boats. But thats a start.
28
(No Transcript)
29
18. WHAT WOULD BE YOUR TOP INTERVIEW QUESTION FOR
PROSPECTIVE DATA ANALYSTS? HOW WOULD YOU ANSWER
THIS QUESTION?
  • Worldwide valuation Perform comparative
    examination of the raw file and the loaded data
    by the following
  • Count the number of rows
  • Count the number of columns
  • Sum the numeric columns
  • Check the data types (i.e., if I thought that a
    column was entirely filled with dates then that
    should persist)
  • Localized assessment
  • Randomly pick a few rows and manually compare
  • Check the distinct elements in textual fields
    (i.e., if categories A, B, and C exist before,
    then thats all I should see after)
  • Check conversions if applicable (i.e., if NA is
    used for non-responses for numerical values then
    the database wont accept it if were storing the
    data in a numerical field)

Suppose that you were provided a flat-file (
Excel, CSV, etc. 41 to manipulate and load
into a database. It contains millions of rows.
Suppose that you were provided a flat-file (
Excel, CSV, etc. 41 to manipulate and load
into a database. It contains millions of
rows. While loading the database from data, you
have to perform an analysis, in case building
some type of mathematical model. While you cant
ever be 100 confident that everything was
processed and loaded correctly, you can do some
things in order to ensure that you are reasonably
confident. Describe for me what you would do.
30
(No Transcript)
31
  • Data Exploration
  • It defines exploring the data for analysis. When
    a data analyst has identified the business
    problem, it is suggested to go through the data
    provided by the client and then analyze the root
    cause of the problem.
  • Data Preparation
  • Data is collected from the client or any other
    sources are usually in the raw form. It plays an
    important role in the process of data analysis as
    it detects the missing values and outliers or any
    other data anomalies and treats accordingly to
    model the data.
  • Data Modelling
  • Once the data is prepared, the process of data
    modeling starts where the model is run repeatedly
    for improvements. It ensures that the best
    possible result is provided.
  • Validation
  • In the process of validation, the model developed
    by data analysts and the model provided by the
    client is validated against each other to find
    out if the developed model will meet the business
    requirements.
  • Deployment of the Model and Tracking
  • This is the final step where the model is
    deployed and is tested for efficiency and
    accuracy.

32
(No Transcript)
33
(No Transcript)
34
22. WHAT ARE THE DIFFERENT TYPES OF HYPOTHESIS
TESTING IN DATA ANALYTICS?
  • T-test It is used for the typical deviation is
    unidentified and the sample size is moderately
    small.
  • Chi-Square Test for Independence These tests are
    used to discover the significance of the
    association between categorical variables in the
    people sample.
  • Homogeneity of Variance (HOV) tests the
    similarity of dispersion parameters in several
    population samples.
  • Analysis of Variance (ANOVA) This kind of
    hypothesis testing is used to analyze differences
    between the means in a variety of groups. This
    test is frequently used similarly to a T-test
    but, is used for a lot more than two groups.
  • Welchs T-test This test is used to discover the
    test for equality of means between two population
    samples

35
  • A data analyst must have the following skills
  • Database knowledge
  • Database management
  • Data Blending
  • Querying
  • Data manipulation
  • Predictive Analytics
  • Basic descriptive statistics
  • Predictive modeling
  • Advanced analytics
  • Big Data Knowledge
  • Big data analytics
  • Unstructured data analysis
  • Machine learning
  • Presentation skill
  • Data visualization
  • Insight presentation
  • Report design

23. MENTION THE KEY SKILLS REQUIRED FOR DATA
ANALYTICS.
36
24. DESCRIBE UNIVARIATE, BIVARIATE, AND
MULTIVARIATE ANALYSIS IN DATA ANALYTICS.
  • Uni-variate analysis
  • Its only 1 variable and therefore you will find
    no relationships, causes. The key facet of the
    univariate analysis would be to summarize the
    information and discover the patterns within it
    to produce actionable decisions.
  • Bi-variate analysis
  • This deals with the partnership between two sets
    of data. These sets of paired data result from
    related sources or samples. A few of the tools
    used to analyze such data includes chi-squared
    tests and t-tests once the data have a
    correlation. The potency of the correlation
    between both data sets will soon be tested in
    bivariate analysis.
  • Multivariate analysis 
  • This is similar to bivariate analysis. It is a
    couple of techniques useful for the analysis of
    data sets that contain more than one variable,
    and the techniques are especially valuable
    whenever using correlated variables.

37
25. WHAT IS THE DIFFERENCE BETWEEN LINEAR AND
LOGISTIC REGRESSION?
Linear regression is a statistical model that
attempts to fit the best possible straight line
between the independent and the dependent
variables when a set of input features are given.
As the output is continuous, the cost function
measures the distance from the observed to the
predicted values. It can be said to be an
appropriate choice to solve regression problems,
for example, predicting sales numbers. On
another hand, Logistic regression gives
probability as its output. By definition, it is a
bounded variable between zero and one, because of
the sigmoid activation function. It is
appropriate to solve classification problems, for
instance, predicting whether a transaction is a
fraud or not.
38
(No Transcript)
39
Website - https//www.henryharvin.com Phone- 91
- 9015266266 Mail - info_at_henryharvin.com
Write a Comment
User Comments (0)
About PowerShow.com