50 Common Data Analyst Interview Questions
### SQL & Database Questions
1. What are the different types of SQL joins?
2. How do you find duplicate records in a table?
3. Write an SQL query to get the second highest salary from an Employee table.
4. What is the difference between WHERE and HAVING clauses?
5. How do you optimize a slow SQL query?
6. What is the difference between UNION and UNION ALL?
7. What are window functions in SQL? Can you give an example?
8. What is the difference between a primary key and a foreign key?
9. How do you rank rows in SQL?
10. What are the advantages of using indexes in SQL?
### Data Analysis & Statistics
11. Explain the difference between correlation and causation.
12. What is the central limit theorem?
13. How do you handle missing data in a dataset?
14. What is the difference between mean, median, and mode?
15. What is p-value in hypothesis testing?
16. Explain standard deviation and variance.
17. What is the difference between Type I and Type II errors?
18. What is A/B testing and how do you implement it?
19. What is the difference between a population and a sample in statistics?
20. Explain the concept of outliers and how to handle them.
### Python for Data Analysis
21. How do you read a large CSV file efficiently in Python?
22. What is the difference between a list and a tuple in Python?
23. What are lambda functions?
24. How do you remove duplicates from a Pandas DataFrame?
25. What is the difference between loc[] and iloc[] in Pandas?
26. How do you merge two DataFrames in Pandas?
27. What is the difference between apply() and map() functions in Pandas?
28. How do you handle missing values in Pandas?
29. What are NumPy arrays and how are they different from lists?
30. Explain the difference between Series and DataFrame in Pandas.
### Data Visualization
31. What are the best visualization types for categorical data?
32. How do you create a bar chart in Matplotlib?
33. What is the difference between a histogram and a bar chart?
34. When should you use a scatter plot?
35. What is the advantage of using Seaborn over Matplotlib?
36. What are the different types of plots available in Power BI?
37. How do you create a calculated field in Tableau?
38. How do you filter data in Power BI?
39. What is a heatmap and when should you use it?
40. How do you handle large datasets in visualization tools like Power BI and Tableau?
### Business & Case Study Questions
41. How would you analyze sales data to find key trends?
42. How do you measure the success of a marketing campaign?
43. How would you identify fraudulent transactions in an e-commerce dataset?
44. How would you forecast product demand for the next quarter?
45. What metrics would you track for an e-commerce business?
46. How do you approach solving a churn analysis problem?
47. How would you handle inconsistent data from multiple sources?
48. How do you communicate technical insights to non-technical stakeholders?
49. What are KPIs? Can you give an example for a SaaS company?
50. How would you approach an ad performance analysis for an online advertising campaign?