tlenaert / elixir-ml-biostats-hackathon

Working repo for the ELIXIR lesson hackathon on Machine Learning and Biostatistics

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

License: CC BY 4.0

Biostatistics and Machine Learning Lesson Hackathon

Working repo for the ELIXIR lesson hackathon on Machine Learning and Biostatistics

Hackathon Leads

in alphabetical order

  1. Shakuntala Baichoo (University of Mauritius, H3ABioNet)
  2. Alexandros Dimopoulos (Institute for Fundamental Biomedical Science, BSRC Al. Fleming / ELIXIR - GR)
  3. Wandrille Duchemin
  4. Van Du T. Tran (Vital-IT, SIB Swiss Institute of Bioinformatics)
  5. Pedro L. Fernandes (Instituto Gulbenkian de Ciência / ELIXIR-PT)
  6. Alireza Khanteymoori
  7. Anmol Kiran (Malawi-Liverpool-Wellcome Trust, H3ABionet)
  8. Claudio Mirabello (Linköping University, NBIS)
  9. Fotis Psomopoulos (Institute of Applied Biosciences, Centre for Research and Technology Hellas / ELIXIR-GR)
  10. Bengt Sennblad (Uppsala University, NBIS)

Timeline

Time Scope
Before day #1 of Hackathon Setup GitHub repo:
- CC-BY license
- Lesson to be published in zenodo
- Leads as "lesson" editors
- All contributors as co-authors

Lesson Preparation:
- Collate material from similar/relevant events
- Provide examples of Learning Objectives, Learning Outcomes and Learning Profiles

Outreach & Engagement:
- Date to be communicated to the ELIXIR Training Coordinators mailing list, the ELIXIR Machine Learning Focus Group mailing list, H3ABioNet, etc
Day #1 of Hackathon Goals are:
- Specify learners’ profiles (i.e. target audience)
- Specify Learning objectives and,
- Specify Learning outcomes
Near the end of the day, decide on the date of Day #2
Before day #2 of Hackaton - Reformat / Restructure all notes
- Create a skeleton structure of the material (pending approval)
Day #2 of Hackathon Goals are:
- confirm structure and flow of the content
- Sprepare the first set of hands-on exercises that target specific learning outcomes (one or more)
- Identify potential timeline for final release

Day #1

Resources

Existing material

@all: feel free to add related content you are already aware of here:

content should be open (i.e. under CC license), and practical (i.e. hands-on exercises, rather than slides)

  1. SIB "Introduction to Machine Learning" course
  • Notes: Practical ML using R. Introductory level. Requires programming knowledge in R (dplyr, tidyverse). Covers Exploratory Data Analysis, k-means, hierarchical clustering, decision trees, random forests, linear regression, glm
  1. BC2 Introduction to Machine Learning
  • Notes: Practical ML using R. Introductory level. Requires programming knowledge in R (dplyr, tidyverse). Covers Exploratory Data Analysis, k-means, hierarchical clustering, decision trees, random forests, linear regression, glm
  1. Galaxy Training Material on Statistics and machine learning
  • Notes: Practical ML using Galaxy. Covers a wide range of topics (Supervised, Unsupervised, Regression, Classification, Deep Learning). Does not offer any coding aspects.
  1. Cambridge Coding Academy GitHub repo (Cambridge Coding Academy website)
  • Notes:
  1. Cambridge bioinformatics training GitHub repo (Cambridge Coding Academy website)
  1. esci web dances

  2. seeing theory

  3. Introduction to Probability and Statistics for Engineers

  4. NBIS Workshop in Biostatistics and Machine Learning

  • Notes: Contains the teaching material (lectures/tutorials/labs) for the different course sessions, see link to course info and schedule for how these sessions are structured in the course. This workshop will be given in November 2020 and material above is in the process of being updated from the 2019 workshop and reformatted into a common format, in other words Work in progress.
  1. NBIS Workshop in Omics Integration
  • Notes: Contains the teaching material (lectures/tutorials/labs) for the different course sessions, see link to course info and schedule for how these sessions are structured in the course. This workshop is currently being given (October 2020) and material above is still being updated, in other words Work in progress

Lesson templates

  1. The Turing Way
  1. The Carpentries Template
  • Notes: Custom created template.
  1. The Open Science Training Handbook
  1. Open Science Open Data Open Science Book

Curriculum Development

Reverse Instructional Design

  • based on research summarised in The Carpentries Instructor Training
  • determine target audience
  • determine practical skill(s) they need to learn
  • create concept maps to guide curriculum design
  • design exercises to assess learner progress towards objectives*
  • write connecting material*

* I don't expect us to reach this point during day #1

Learner's Profiles

To teach effectively, you have to know who you are teaching. Your audience can be identified in many ways. Frequently people who are hosting a workshop have a specific audience in mind, based on their own experience.

One "creative" way to think deeply about the audience for a workshop is to take a few moments to write learner profiles. Learner profiles have three parts: the person’s general background, the problem they face, and how the course will help them. One example of a learner profile for a Software Carpentry workshop might be:

Learner Profile Example (copied from the The Carpentries Instructor Training curriculum)

João is an agricultural engineer doing his masters in soil physics. His programming experience is a first year programming course using C. He was never able to use this low-level programming in his activities, and never programmed after the first year.

His work consists of evaluating physical properties of soil samples from different conditions. Some of the soil properties are measured by an automated device that sends logs in a text format to his machine. João has to open each file in Excel, crop the first and last quarters of data values, and calculate an average.

Software Carpentry will show João how to write shell scripts to count the lines and crop the right range for each file, and how to use R to read these files and calculate the required statistics. It will also show him how to put his programs and files under version control so that he can re-run analyses and figure out which results may have been affected by changes.

What Are The Learning Outcomes?

Once you have an idea of your intended audience, the next step is to think through the goals for your workshop. These goals are usually communicated through learning outcomes. We need to be realistic about what audience can learn in the time available (1-2 days). Having specific outcomes helps avoid creep in lesson scope. Bloom's Taxonomy can help with this:

"By the end of the [lesson|section], learners will be able to..."

Bloom's Taxonomy

Concept Maps

Our tool of choice for representing someone’s mental model is a concept map, in which facts are bubbles and connections are labeled connections.

Link from the Carpentries Instructor Training

Learning Outcomes

Objectives vs. Outcomes

A learning objective is what a lesson strives to achieve. A learning outcome is what it actually achieves, i.e. what learners actually take away. The role of summative assessment is therefore to compare learning outcomes with learning objectives.

From the Teaching Tech Togehter handbook

About

Working repo for the ELIXIR lesson hackathon on Machine Learning and Biostatistics

License:Other