Welcome to the comprehensive repository for course DSCI 510 - Foundations of Data Management taught by Professor Wei-Min When
at USC during the Spring 2024 semester. This centralized hub contains all coursework materials, including assignments and project solutions, organized into folders representing distinct modules covered in the course.
Tip
Before exploring the materials, take a moment to review the license and disclaimer for responsible utilization. The repository covers various topics, providing valuable insights and hands-on experience in Data Mining.
- Course Name: DSCI 553 - Foundations & Applications of Data Mining
- Instructor: Prof. Wei-Min Shen
- Semester: Spring 2024
Feel free to explore the assignments, projects, and solutions provided as learning aids. Whether you're a beginner or an experienced practitioner, this repository aims to be your companion in mastering the intersection of foundational data mining fundamentals within Data Science & Engineering. Happy learning!
Caution
Please note that this repository serves as a reference guide and should be utilized as a tool for learning and comprehension. It's paramount to refrain from engaging in any activities associated with plagiarism. Embrace the wealth of knowledge herein to enhance your understanding and augment your skill set in Data Mining.
Assignment | Topic Covered | Grade |
---|---|---|
HW 1 | Data Exploration of Yelp Reviews Dataset with Spark RDD |
7/7 |
HW 2 | Implement SON Algorithm to find Frequent Itemsets using Spark and exploration of Ta Feng Dataset |
7/7 |
HW 3 | Build Hybrid Recommendation systems integrating Item-based Collaborative Filtering and Model-based approaches using XGBRegressor |
7/7 |
HW 4 | Building Graphs and Community Detection based on Graphframes and Girvan-Newman algorithm |
7/7 |
HW 5 | Data Streaming Analysis - Bloom Filter , Flajolet-Martin , and Reservoir Sampling |
7/7 |
HW 6 | Clustering using Bradley-Fayyad-Reina (BFR) algorithm on synthetic dataset |
7/7 |
--- | --- | |
Competition | Recommendation System on Yelp Reviews Dataset | 8/8 |
--- | --- | |
Quizzes | Consists of PDF documents with question bank for quizzes | - |
Note
Overall Grade: A-
- USC DSCI 553 Fall 2023 - rutujabhandigani/DSCI553-Data-Mining
- USC DSCI 553 Fall 2022 - CyL97/DSCI-553
- USC DSCI 553 Fall 2021 - Shayne-Yang/DSCI_553
- USC DSCI 553 Spring 2021 - pohann/DSCI553
- Kayvan Shah |
MS in Applied Data Science
|University of Southern California
This repository is licensed under the BSD 5-Clause
License. See the LICENSE file for details.