Chapter 1 Welcome

Hello! This book aims to empower scientists with resources to use to learn computational and open data science skills. Often, it is hard to even know which skills you need! This book’s “choose your own pathway” style invites you to choose an intentional and well-worn path rather than creating one from scratch.

This book is not an exhaustive list of skill-building resources available but is highly curated, opinionated, and increasingly narrated to meet the needs of scientists that are new to coding and/or those looking to upgrade existing workflows. It is set up in a style that in ecology we call a “dicotomous key”, where depending to your answer to a prompt, you take yourself to a different part of the book. Resources center around open source languages and reproducible, collaborative workflows, including R, Python, Unix, and GitHub.

1.1 How to use this book

Use this book to help create your learning plan to setup new analytical workflows and/or streamline existing ones. It will direct you to existing tutorials, communities, or scripts for each step, encouraging you to think deliberately along the way so that you can incrementally improve your skills and workflows. For example, if your current workflow requires multiple software interfaces and languages, this book will challenge you to think about whether you should focus on learning a new language or software, or on streamlining how languages or software interoperate. The theme is to reduce friction as much as possible, so you can focus on science.

While you can use this book alone, it is much better to learn together (see Chapter 11!). Start with a colleague/friend; you can be “accountability buddies” for each other while you choose you own pathways together, and you can continue to be even as your paths diverge.

Read more in the Preface in Chapter 2 and then go to Chapter 3 to choose your own pathway.

1.2 About

We are developing this book specifically to meet the needs of biomedical scientists who are part of the Chan Zuckerberg Initiative’s Neurodegenerative Challenge Network — but it is designed for wide use across research domains. It was started by Julia Stewart Lowndes Openscapes and contributed to by Christopher J. Sifuentes with help from a growing number of NDCN contributors, including you. Demonstrating the power of open source not only for analysis but for communication, the book is written in R (bookdown) and published through GitHub.

Contributors include:

  • NDCN Computational Advisors
  • NDCN Office Hours participants
  • Justin Savage
  • Oliver Tam

1.3 Contributing

This imperfect and incomplete book is a work in progress. But it is available openly as a resource while it is iteratively improved. Please help us improve! Do you have an idea for resources, improvements on text, or a better title for the book? (This book’s title is really not great. It aims to give the reader a sense of participation — without infringing on copyright — and a sense of R and Python.)

We welcome contributions: please edit the book or share resources that have helped you via email or GitHub. We are thankful for your contributions and will include them after considering whether they fit within our opinionated framework and audience needs.


Creative Commons License This book is licensed under a Creative Commons Attribution 4.0 International License.