|
|
| BIL3020 | Introduction to Data Science | 3+0+0 | ECTS:4 | | Year / Semester | Spring Semester | | Level of Course | First Cycle | | Status | Elective | | Department | DEPARTMENT of COMPUTER ENGINEERING | | Prerequisites and co-requisites | None | | Mode of Delivery | | | Contact Hours | 14 weeks - 3 hours of lectures per week | | Lecturer | Dr. Öğr. Üyesi Murat AYKUT | | Co-Lecturer | | | Language of instruction | Turkish | | Professional practise ( internship ) | None | | | | The aim of the course: | | The course intends to teach the students for the fundamentals of data science, data preprocessing operations, data reduction methods, learning approaches, and data visualization techniques with practical code examples. |
| Learning Outcomes | CTPO | TOA | | Upon successful completion of the course, the students will be able to : | | | | LO - 1 : | Learns the basic concepts of data science. | 1.1 | 1, | | LO - 2 : | Gain knowledge on data preprocessing, data reduction, and data augmentation methods. | 1.1 | 1,3, | | LO - 3 : | Gain knowledge on learning from data approaches. | 1.1 | 1,3, | | LO - 4 : | Gain knowledge on evaluation approaches of learning methods. | 1.1 | 1, | | CTPO : Contribution to programme outcomes, TOA :Type of assessment (1: written exam, 2: Oral exam, 3: Homework assignment, 4: Laboratory exercise/exam, 5: Seminar / presentation, 6: Term paper), LO : Learning Outcome | | |
| Introduction; Data Types; Data Preparation; Dealing with Missing Values; Dealing with Noisy Data; Data Reduction; Data Augmentation; Feature Selection; Instance Selection; Outlier Removal; Discretization; Supervised Learning; Regression Modeling; Unsupervised Learning; Model Evaluation; Association Rules; Data Summarization and Visualization. |
| |
| Course Syllabus | | Week | Subject | Related Notes / Files | | Week 1 | Introduction, Data Types | | | Week 2 | Data Preprocessing, Missing Value, Noisy Data | | | Week 3 | Data Reduction: Feature Selection, Feature Extraction | | | Week 4 | Data Reduction: Case Reduction, Feature Discretization | | | Week 5 | Data Augmentation | | | Week 6 | Outlier Removal | | | Week 7 | Supervised Learning: Logistic Regressiob, kNN, Decision Trees | | | Week 8 | Supervised Learning: Naive Bayes, SVM, Ensemble Learning | | | Week 9 | Midterm exam / Homework | | | Week 10 | Regression Modelling | | | Week 11 | Unsupervised Learning: k-Means, Expactation-Maximization, Hierarchical Clustering | | | Week 12 | Model Evaluation | | | Week 13 | Association Rules: Apriori, FP-Growthi Collaborative Filtering | | | Week 14 | Fundamentals of Text Mining | | | Week 15 | Data Summarization and Visualization | | | Week 16 | Final exam | | | |
| 1 | Larose, C. D., Larose, D. T. 2019; Data Science Using Python and R, Wiley Publishing, 256 pages.
| | | |
| 1 | Garcia, S., Luengo, J., Herrera, F. 2015; Data Preprocessing in Data Mining, Springer, 320 pages. | | | 2 | Igual, L., Seguí, S. 2017; Introduction to Data Science: A Python Approach to Concepts, Techniques and Applications, Springer, 218 pages. | | | |
| Method of Assessment | | Type of assessment | Week No | Date | Duration (hours) | Weight (%) | | Homework/Assignment/Term-paper | 11 | | 2 | 50 | | End-of-term exam | 15 | | 2 | 50 | | |
| Student Work Load and its Distribution | | Type of work | Duration (hours pw) | No of weeks / Number of activity | Hours in total per term | | Yüz yüze eğitim | 3 | 14 | 42 | | Sınıf dışı çalışma | 3 | 14 | 42 | | Ödev | 8 | 1 | 8 | | Dönem sonu sınavı için hazırlık | 6 | 1 | 6 | | Dönem sonu sınavı | 2 | 1 | 2 | | Total work load | | | 100 |
|