slrjuQztbCEZEBEALppGwQ==2025-04-15T02:45:00Zspring 2025
TL;DR - Five Stars. Incredible course, low workload, easy A, and good option to pair with a heavier course if you're already experienced. Take early in your program, it's HIGHLY applicable to pretty much any DS role.
I'm wrapping up the new DAB with Dr. Lizhen Xu this semester. I don't have any experience with the previous version for comparison, but based on what I've heard about it many people complained that they felt like it didn't teach useful material and that the group project was a PITA. I heard it was just something you had to get through. I'm happy to report that this new revamped course with a new instructor is nothing like that. The new version with Dr. Xu has been an absolute pleasure and I'm sad with how quickly it was over. For context, I'm a full-time data scientist with a decade of experience. While I didn't learn much from this course, I found it FAR superior than other courses and material introducing these topics that I saw when I was starting out and/or what my organization currently uses to train analysts.
Structure: This course is taught in R and in a Spring/Fall semester you have homework due every two weeks that is split into two parts. There are recorded lectures to watch that comprehensively prepare you for the assignments. You complete the homework offline and then take a quiz. There are no group projects nor any large proctored exams. The workload is pretty light if you already have a bit of experience with R and data analysis. Other than watching the lectures during my commute & gym time, I've spent 1-2 hours a week on this course. If you're a newbie, you can expect to spend ~10 hours per week according to reports from classmates. Each module is a different topic. The course starts with linear regression, then into some more nuanced types of regression/classification models, then into unsupervised learning and text mining, and then finishes with a fun baby introduction into neural networks and deep learning (you will review the code for a functioning Convolutional Neural Network that you can run if your hardware is sufficient). This class builds naturally from 6501 both in content and technology used, but you could take it as your first course in the program as Dr. Xu builds up your R knowledge from zero. While the NN/DNN content isn't deep enough to really give you a new skill, the rest of the topics are covered well and should immediately result in new tools in your toolbox. If you've been in the career-field for awhile, you will still probably pick up some tidbits you didn't know before.
Homework and Quizzes: Each homework is completed offline and then you take a non-proctored quiz that asks you two types of questions. The first type of questions are concept based and pretty much free. They ask you to interpret code or terms, fill in code snippets, or interpret the results of your analysis. The second type of questions require you to correctly identify the output of a certain step of the homework. These can be tricky due to both the large attention to detail required to not select decoy wrong answers, but also because they are entirely dependent on completing the assignment correctly at earlier steps. Errors made early in your code can carry forward and cause you to get multiple wrong answers later. In the future, I think the homework should be done in a Vocareum notebook like iCDA so that students can test their variables before moving on. Currently, there's no easy way to tell if your output is wrong other than not finding one of the four possible answers in your output. The first homework is worth 8% of your grade and the rest are worth 13% each.
Lectures: These are prerecorded and split into two types. The first is the presentation of the material via voiced over power point slides that are common in this program/format. They are excellent. Dr. Xu structures the information well both in terms of how the slides are designed and the amount of content, review, and pacing. While this in an intro course, he does a good job including deeper details like the formulas for distributions and models to keep advanced students interested. He has a slight accent, but not enough to cause anyone troubles understanding him, especially since he is one of the best oral presenters I've encountered in school or my career. Dr. Xu not only clearly communicates the content of each module, his tone and pace keep the lectures interesting. The second type of lecture is Dr. Xu coding and analyzing data similar to the assigned homework. While you can't get the exact answers for the quiz , you can clearly see step-by-step how to accomplish every task along with the exact code he used. Just like the formal lectures, these videos are well narrated as Dr. Xu explains what he is doing and why. These are good enough to maybe even take the crown from Dave Goldsman as top lecturer in the program.
Office Hours: I haven't been able to attend most of these, but I've watched the recordings of most of them. First, the man himself held office hours every week. You actually get to directly interact with the instructor! This was a welcome change from other courses I've taken. Dr. Xu is not only incredibly helpful during these sessions, but quite personable and entertaining. The TAs host a homework review/prep every week as well. Just like every other course I've taken in the program so far, the TAs do a fantastic job helping students out and providing tips and tricks.
Complaints: Like I mentioned above, I wish there was a built-in mechanism for students to check their output before the quiz and identify their errors. I didn't personally have issues, but many of the newer students did and then you have to walk the fine line between helping them with their code and academic integrity violations. My only other complaint is that this course was so light. While I recognize this in an introductory course, I wish it wasn't. I feel like 6501 already does a fine job introducing people to OMSA , and I would prefer that Dr. Xu add another 80%-120% of material so that this mandatory course is more useful to already practicing students. Everyone will have their own opinion on this obviously, and I recognize that I'm wishing that the course was something different than what's it's designed to be.
Conclusion: I can't possibly recommend this course enough both to novices and adepts. It's easy, engaging, and most importantly, useful. I suggest taking this as early in your degree as possible, and consider pairing it with another course as the workload is so incredibly low.
Rating: 5 / 5Difficulty: 1 / 5Workload: 4 hours / week