HS 650 001 WN 2018
Overview
This open graduate course will provide a general overview of the principles, concepts, techniques, tools and services for managing, harmonizing, aggregating, preprocessing, modeling, analyzing and interpreting large, multi-source, incomplete, incongruent, and heterogeneous data (Big Data). The focus will be to expose students to common challenges related to handling Big Data and present the enormous opportunities and power associated with our ability to interrogate such complex datasets, extract useful information, derive knowledge, and provide actionable forecasting. Biomedical, healthcare, and social datasets will provide context for addressing specific driving challenges. Students will learn about modern data analytic techniques and develop skills for importing and exporting, cleaning and fusing, modeling and visualizing, analyzing and synthesizing complex datasets. The collaborative design, implementation, sharing and community validation of high-throughput analytic workflows will be emphasized throughout the course.
Prerequisites
You can view the General DSPA Prerequisites Links to an external site.. To ensure students are comfortable in this DSPA course, consider taking the self-assessment (pretest) Links to an external site. prior to enrolling in the course.
To summarize, students should have prior experience with college level (undergrad) mathematical modeling, statistical analysis, or programming courses or permission of the instructor. Some MOOCs may be taken as prerequisites, e.g., Corsera Links to an external site., EdX1 Links to an external site., EdX2 Links to an external site.. Additional examples of remediation courses are provided in the self-assessment (pretest) Links to an external site..
Course Objectives
Trainees successfully completing the course will:
- Gain understanding of the computational foundations in Big Data Science
- Develop critical inferential thinking
- Gather a tool chest of R libraries for managing and interrogating raw and derived, observed, experimental, and simulated big healthcare datasets
- Possess practical skills for handling complex datasets.
Target Audience
This course will be appropriate for trainees who have significant interest in learning data scientific and predictive analytic methods that can commit substantial amount of time to focus an undivided attention to study, practice and interact with other trainees in the course.
Notes
Instructor
Ivo D. Dinov Links to an external site.
Competencies
Logistics
Acknowledgments
The DSPA course is made possible with substantial support from Michigan Institute for Data Science (MIDAS) Links to an external site., Statistics Online Computational Resources (SOCR) Links to an external site., Health Behavior and Biological Sciences (HBBS/UMSN) Links to an external site., and Computational Medicine and Bioinformatics (DCM&B) Links to an external site..
Fair Use Licensing
Like all SOCR resources Links to an external site., and to support open-science, these resources (learning materials and source-code) are CC-BY Links to an external site. and LGPL Links to an external site. licensed.
Course Summary:
Date | Details | Due |
---|---|---|
Fri Jan 19, 2018 | Assignment HW_Project_1 | due by 11:59pm |
Fri Feb 2, 2018 | Assignment HW_Project_2 | due by 11:59pm |
Fri Feb 16, 2018 | Assignment HW_Project_3 | due by 11:59pm |
Fri Mar 9, 2018 | Assignment HW_Project_4 | due by 11:59pm |
Sat Mar 24, 2018 | Assignment HW_Project_5 | due by 11:59pm |
Fri Apr 6, 2018 | Assignment HW_Project_6 | due by 11:59pm |
Fri Apr 20, 2018 | Assignment Term Paper Project | due by 11:59pm |
