MWF 1:15-2:05, NVIDIA Auditorium

The course has an email list that reaches all TAs as well as the professor: stats202-aut1213-staff@lists.stanford.edu.

*All questions should be directed to this email list, rather than TA or the instructor.*

- Austen Head
- Office: Sequoia Hall #208
- Office hours: M 2:30-4:30

- Murat Erdogdu
- Office: Sequoia Hall #206
- Office hours: TTh 9:45-10:45

- Linxi Liu
- Office: Sequoia Hall #227
- Office hours: MW 10:00-11:00

- Introduction to Data Mining, Tan, Steinbach & Kumar.
*Required*

- Introduction to Statistics with R, Dalgaard.
- Elements of Statistical Learning, Hastie , Tibshirani & Friedman (A more statistically advanced treatment of most of the topics.)
- Data analysis and graphics using R: an example-based approach, John H. Maindonald, John Braun.

We will be using R for most examples, to demonstrate some examples. You can read about it here. Another language / environment is the scipy environment.

I will be using R from the ipython notebook so as to record the sessions in class. To run the notebooks, one can use the Enthought Python Distribution. The output will be posted on the webpage for use, even without running it in the notebook.

Some familiarity with linear algebra and statistical methods.

- homework (6 total, best 5 count); 60%
- midterm (W 10/31 in class); 15%
- final exam (according to Stanford calendar: W 12/12 @ 8:30AM); 25%
- Kaggle competition – possibility for a 20% bonus on final exam

- The worst of the 6 homework grades will be dropped.
- The homework assignments and solutions will be posted on the coursework web page, though perhaps with some delay after being posted on this webpage.
- Homework is to be submitted electronically as a PDF file in the dropbox on coursework https://coursework.stanford.edu/portal/site/F12-STATS-202-01.
- Each homework should be a single PDF following the pattern: “
*LastName_FirstInitial_HW1.pdf*”, “*LastName_FirstInitial_HW2.pdf*”. For example, my 4th homework would be called “*Taylor_J_HW4.pdf*”. - Include your name in the PDF file.
- Include copies of your code in the PDF file.
- Only submit one PDF file for each homework.
- You may discuss homework problems with other students, but you have to prepare the written assignments yourself. This means you are not to copy other students’ code or other parts of their homework.
- Homework will be timestamped based on when it is emailed to the staff mailing list. Each homework will have a specific deadline – late homeworks will be penalized 10 points.
- SCPD students should also CC their submissions to scpd-distribution@lists.stanford.edu

- Following the Stanford calendar: Wednesday, December 12 @ 8:30AM.
- If you cannot take the exam at that time and day, then you will have to take this class in a different quarter. Exceptions will only be made due to official university affairs, such as athletic commitments.
- Take a look at last year’s final for practice. This was assigned by Prof. Holmes, so will be slightly different than what we have covered.
- Here is another practice which was the final exam last year.

- Here is a practice midterm exam from last year. This was assigned by Prof. Holmes, so will be slightly different than what we have covered.
- Here is a second practice midterm which was last year’s midterm.