Türkçe | English
GRADUATE INSTITUTE of HEALTH SCIENCES / DEPARTMENT of BIOSTATISTICS and MEDICAL INFORMATICS
Biostatistics and Medical Informatics-Doctorate
Course Catalog
https://www.ktu.edu.tr/tebad
Phone: +90 0462 3775680
SABE
GRADUATE INSTITUTE of HEALTH SCIENCES / DEPARTMENT of BIOSTATISTICS and MEDICAL INFORMATICS / Biostatistics and Medical Informatics-Doctorate
Katalog Ana Sayfa
  Katalog Ana Sayfa  KTÜ Ana Sayfa   Katalog Ana Sayfa
 
 

TBB6021Biological Databases and Datamining2+2+0ECTS:7.5
Year / SemesterFall Semester
Level of CourseThird Cycle
Status Elective
DepartmentDEPARTMENT of BIOSTATISTICS and MEDICAL INFORMATICS
Prerequisites and co-requisitesNone
Mode of DeliveryFace to face, Practical
Contact Hours14 weeks - 2 hours of lectures and 2 hours of practicals per week
Lecturer--
Co-Lecturer
Language of instructionTurkish
Professional practise ( internship ) None
 
The aim of the course:
The aim of the course is to provide the ability to integrate different types of biological data and databases and to analyze biological data. Moreover, the aim of this course is to provide the ability to create original databases on the basis of MySQL or SQLite using different types of biological data and analyze them with different packages in the R programming language. In this course, it will be used machine-learning methods such as Support Vector Machines and Multiple Regressions on experimental data to classify and predict gene function and regulation.
 
Programme OutcomesCTPOTOA
Upon successful completion of the course, the students will be able to :
PO - 1 : Identify different types of biological data and know biological databases.1,5
PO - 2 : Knowledge of database structure and database design using biological data.1,3,4
PO - 3 : To be able to analyze and evaluate gene function predictions and classifications in biological databases using machine learning methods.1,4,5
CTPO : Contribution to programme outcomes, TOA :Type of assessment (1: written exam, 2: Oral exam, 3: Homework assignment, 4: Laboratory exercise/exam, 5: Seminar / presentation, 6: Term paper), PO : Learning Outcome

 
Contents of the Course
The course is divided into three sections: 1) Introduction to MySQL and R 2) Introduction to different data types 3) Machine learning methods for data mining
 
Course Syllabus
 WeekSubjectRelated Notes / Files
 Week 1Biomolecules, bioinformatics, basic terminology
 Week 2Introduction to databases / Basic SQL: MySQL vs SQLite.
 Week 3Complex SQL queries / Using indexes
 Week 4Genome Databases (Browsers, Resources, File Formats) / Functional Annotations: (GO-terms) /Writing Functions in R
 Week 5Transcriptome Databases / Pathway and Gene Regulatory databases
 Week 6Protein Interaction Databases (Biogrid, String) / Building and Querying an Interaction Network Database
 Week 7Creating a database for integrating different Biological data types
 Week 8Mid-term Exam
 Week 9Differential Gene Expression / Correlation / Clustering
 Week 10Decision Trees / Installing RWeka
 Week 11Logistic Regression
 Week 12Evaluating predictions (GO-term enrichment and ROC curves)
 Week 13Article discussion
 Week 14General review
 Week 15Student presentations
 Week 16Final Exam
 
Textbook / Material
1Zvelebil M., Baum J. O. 2007 Understanding bioinformatics, Garland Science.
 
Recommended Reading
 
Method of Assessment
Type of assessmentWeek NoDate

Duration (hours)Weight (%)
Mid-term exam 8 1 30
Presentation 14 1 20
End-of-term exam 15 1 50
 
Student Work Load and its Distribution
Type of workDuration (hours pw)

No of weeks / Number of activity

Hours in total per term
Yüz yüze eğitim 4 14 56
Sınıf dışı çalışma 8 14 112
Arasınav için hazırlık 2 7 14
Arasınav 2 1 2
Ödev 10 10 100
Proje 2 6 12
Dönem sonu sınavı için hazırlık 2 16 32
Dönem sonu sınavı 2 1 2
Total work load330