May 6, 9:00AM-May 8, 12:30PM : Visualization of biomolecular data
Description
In this course participants will explore key principles of creating and designing data visualizations with the ggplot2 graphing library in R. The module will use case studies with data from large-scale quantitative mass spectrometry-based proteomic experiments, however the design principles will more broadly apply. Specific topics include effective composition and layout of different visualizations, effective use of color, general strategies for working with different types of plots and charts, improving figure clarity, and techniques for visualizing multidimensional data. Through a mixture of lecture and hands-on activities, participants will be invited to consider the ways in which good design can help communicate the information.
Target audience
- Target audience are experimental scientists, bioinformaticians, computer scientists, data scientists, statisticians or engineers, interested in visualizing data in general, and quantitative proteomic data in particular. A minimal prior exposure to R (e.g., with the course ‘Beginner’s statistics in R’) is expected.
References
Claus O. Wilke, Fundamentals of Data Visualization: A Primer on Making Informative and Compelling Figures. O’Reilly Media, 2019. ONLINE version and PAPER version.
‘Points of View’ in Nature Methods
Speakers
- Steven Braun, Laurent Gatto, Nils Gehlenborg, Alexander Lex
Tentative schedule
Monday, May 6, 2019
- 8:00 a.m. Registration
- 9 a.m. Keynote, Alexander Lex
- 10:30 a.m. Refreshments
- 11:00 a.m. Intro to plotting with base R (from exploring raw data to summary of conclusions), Laurent Gatto
- 12:30 p.m. Lunch
- 1:30 p.m. Intro to ggplot2, Laurent Gatto
- 3:00 p.m. Refreshments
- 3:30 p.m. Lecture : Communicating scientific data through information design, Steven Braun
- 5:00 p.m. Viz critiques, Steven Braun
- 6:00 p.m. Adjourn
Tuesday, May 7, 2019
- 8:00 a.m. Q&A
- 9:00 a.m. Good visualization practice (transformations, MS plots, correlation, volcano), Laurent Gatto
- 10:30 a.m. Refreshments
- 11:00 a.m. R and Bioconductor for mass spectrometry, Laurent Gatto
- 12:30 p.m. Lunch
- 1:30 p.m. More visualization : upSetR, PCA, heatmap, Laurent Gatto
- 3:00 p.m. Refreshments
- 3:30 p.m. Practice, Laurent Gatto
- 4:00 p.m. Poster session, Meena Choi
- 6:00 p.m. Dinner at Back Bay Social dinners
Wednesday, May 8, 2019
- 8:00 a.m. Q&A
- 9:00 a.m. Keynote, Nils Gehlenborg
- 10:30 a.m. Refreshments
- 11:00 a.m. Interactive data visualization, Laurent Gatto
- 12:30 p.m. Wrap-up