HW6
- Due Nov 27, 2020 by 11:59pm
- Points 100
- Submitting a file upload
Homework Project 6
- Due Fri, Nov. 27, 2020
- Homeworks, projects and assignments
- Homework Submission Rules
- Homework Headers
Problem 6.1 (Network data visualization)
- Use the "Les Miserables Links to an external site." dataset (03_les miserablese_GraphData.txt)
- Visualize the character network as an undirected graph
- Summarize the graph and explain the output
- Calculate the degree and the centrality of this graph
- Find some important nodes (corresponding to key novel characters)
- Will the results change if we assume the graph is directed?
Problem 6.2 (Feature Selection):
Use the 06_PPMI_ClassificationValidationData_Short dataset
- Set
ResearchGroup
as class variable. - Delete irrelevant columns (e.g.
X
,FID_IID
) and select only the PD and Control cohorts, as the ResearchGroup feature includes PD, Control and SWEDD (which are mild cases of PD) - Properly convert the variables types, as may be necessary
- Compare alternative feature selection methods -
Boruta
, Random Feature Elimination (RFE), and stepwise feature selection - Summarize and visualize the results
- Report and compare the variables selected by these methods. How much overlap is there in the selected salient features?
Rubric
Keep in mind that 36 students have already been assessed using this rubric. Changing it will affect their evaluations.
Criteria | Ratings | Pts | ||
---|---|---|---|---|
Correctness and scientific validity
threshold:
pts
|
|
pts
--
|
||
Result reproducibility
threshold:
pts
|
|
pts
--
|
||
Content focus, presentaiton style, and clarity
threshold:
pts
|
|
pts
--
|
||
Total Points:
100
out of 100
|