IYSE 6501 Review - Sp 2021
This is an absolutely fantastic course. If you have to take a single course in the entire AI/Analytics/ML areas at Ga Tech - this is the course. It covers everything and gives you a framework for further exploration. It is a little slanted towards a Statistics practice (than say AI/Comp. Science), but there is overlap (I have taken AI, and some topics are in both classes). The "unofficial textbook" is Intro to Statistical Learning. But you do not need it. I read the first few chapters and the chapter on SVMs, and the material in the text is much more advanced than the course expectations. I stopped reading it after the second week (but it is a summer goal to read the entire text. Oh, the ambitions of a simple man!).
Stick with the videos, Piazza, working on homework, and the "knowledge checks," which are little quizzes after the week's videos.
The Big Picture
This course is exceptionally well organized - from the videos to the homework and even the instructors on Piazza. I will cover each one in a little more detail, but here is a summary of what to expect.
This course is harder than it may seem at first, and it can sneak up on you. The pace starts out slow, and it is easy to fall behind. By the time you have to take the first midterm, you realize there's a ton of material to prepare for the first midterm. The second midterm is less than a month after the first, and the work deliverables keep comping. The Project is due right after/around the second midterm, and you still have homework due. It can be a lot to balance. My advice is to keep on top of the assignments and videos and try not to fall behind - catching up can be a little challenging. Easier said than done.
The curve seems pretty high, but they do not publish it. My guess is that most people end up with a B, but you can get an A if you are really good. The midterms carry most of the points (about 75%), with the remaining 25% divided between the Project (7%) and the homework (15%).
Midterms
Midterms are long and tricky. They are fair, but some of the questions are wordy and nuanced with lots of build-up with background context. You have to read them carefully because they want you to choose between subtle features in various models, etc. They allow you to have a cheat sheet. For the most part, I rarely used it - but YMMV.
The best way to study - I think - is to watch the videos. The videos are excellent. A previous reviewer complained that Dr. Sokol had a monotonous delivery, but I disagree. I actually liked his presentation style. It was very clear, well thought out, and Dr. Sokol has a fine delivery. I watched them about three times total and learned something each time. But the enjoyability of the videos is clearly a subjective assessment.
The final is cumulative - 65 questions and 3 hours. It really is cumulative, and there is a lot of material to choose from. Towards the end of the class, there is a new question type that occurs. The new question types ask you to apply the models you have learned during the course - why did this one work, and why will this maybe not work. You have to know your stuff.
Homeworks
Homeworks are R-programs that cover the topic of the week. Some of the problems were actually pretty difficult, but the grading is very generous. Also, the TAs hold a session where they walk you thru each homework and help you set it up. The point is to get a hands-on feel to the theory. Assignments are graded by your peers, and as long as you give it your best shot, the grading is generous. The Project is like a more extensive homework assignment, and they ask you to apply analytics to a public problem you can choose from.
Piazza
The instructors and TAs are sensational! I did not realize this until the latter half of the course, but you can ask any question you have on the material on Piazza, and the instructors WILL answer all your questions - every single time. And they really do a great job. There were many great conversations on Piazza with great student and instructor involvement. Piazza was a great resource, especially for exam prep and The Project.
Summary
This course lays the foundation for a career in Data Science. It covers a lot of material in sufficient enough detail to know how to get started on a problem. This was my first class from the IYSE side of the Ga Tech OMSC program, and I must say - if other courses in IYSE are similar in quality, I regret not applying for the Analytics Master instead of the Computer Science ML masters. (Both are good, I just really learned a lot in this course).