The course trains the students in statistical methods using the open-source software R. This course will help students to do data analysis, visualization, and apply statistical techniques on large-scale datasets such as Periodic Labour Force Survey, Annual Survey of Industries, and NSS data. Students will also learn the J‑PAL’s Optimal Design Software for power calculations. The course provides instruction on how to clean data from large scale household surveys and conduct preliminary analyses in R. The students would do exploratory data analysis on different large-scale datasets. Students would learn probability theory and hypothesis testing by simulating probability distributions. An introduction to simple linear regression and survey design methods would be covered in the course.