Regarding that lab notebook… (Part 2: Is there a better way?)

6 minute read

Published: December 10, 2018

Searching, searching, for a better receptacle.

Curious about better notebook options and what other scientists are using, first I asked around my computational group colleagues. They generally do not use any specialised ELN either, but the usual OneNote, gDoc, pen and paper, and the likes.

Turning my queries online, I found this paper in PLoS Computational Biology <Ten Simple Rules for a Computational Biologist’s Laboratory Notebook>. Although it is more about how to document rather than what to use to document, it is helpful for oneself to identify what a lab notebook is to them. For example, do you regard your notebook to be a legal record? (You should, by the way, according to the paper). Do you want your notebook to be accessible to anyone in the world? (See open-notebook science). Another related paper you may want to check out, regarding folder structure: <A Quick Guide to Organizing Computational Biology Projects>.

I also came across this Nature article <How to pick an electronic laboratory notebook>, which from the get-go already assumes that traditional notebook is cumbersome and electronic lab notebook (ELN) is definitely the future, but there are so many ELN options – how to choose? (If you are interested, one of the references has a comprehensive list.)

I looked at the options mentioned in the article and well, I don’t know, somehow I am turned off by all the commercial options. I think the ideal ELN for science should be open-source like Jupyter notebook.

One workflow from the article resonates with me, though –

Every month, his team exports pages to PDF files and signs them electronically; the files are then moved to a directory where they cannot be changed.

I like it because it lets people to use any form of notebook they want (even pen and paper!), as long as it is exportable to PDF, and at the same time fulfilling audit requirement of timestamping and non-editability.

Version control as timestamping and more

I would submit that a better (but not necessarily easier) way to do this is to borrow a concept from software development, version control. In software development, the version control software keeps track of the source code of the project. It is a sort of more sophisticated version of Word’s track changes feature.

Another thing that we should get into version control...
Image credit: PHDComics

In our case, our notebook is the source code. Indeed, there are others who have the same idea. Googling ‘git lab notebook’ gives several hits, for example this one is centred around the calendar days and lives in GitHub. (The version control software that I am familiar with is git, so I will mention git a lot.)

I shall attempt to describe the git version control workflow: You work on a project which has features A and B; you stage the changes related to feature A and commit, thus timestamping and logging it in the timeline; you do the same for feature B. Staging is precisely for this purpose, to bundle together changes in a logically-structured manner. The version control software only saves the changed files (git saves the whole file, other softwares may save only the difference), so this is a good thing, storage-wise.

In this way you would accumulate a linear collections of timepoint snapshots (branching in the timeline is possible but let’s exclude it for simplicity), in which you have a record of every change in your notebook (isn’t this like… blockchain?). Even if you go back and revert some changes you make because of a mistake for instance, that is recorded as yet another commit.

Sounds arcane? Probably to the typical (non-computational) scientists, all this sounds cumbersome to learn and get into. Even for me, typing all those obscure git commands to the terminal take sometime to get used to.

Ideally, they (I’m not sure who… kind strangers?) should make git version control more user-friendly and simpler for the user (nice GUI; and we can also skip the staging for example) and for the auditor (they can see the whole timeline and there is a slider button to quickly review all changes), and I think many would gladly use it. The weakness of this approach is, the ELN needs to be largely plain text (just like a software source code), otherwise the changes between commits may be difficult to view.

Sounds like something you would use? Well, actually that thing I described above sort of exists already. If you install Anaconda, a popular Python distribution, Visual Studio Code (VSCode) is bundled inside. VSCode is an open-source code editor, and you know what, it has git built-in. You can stage and commit away with abandon without arcane terminal commands. I would suggest keeping the notebook in the form of Markdown files, since they are plain text, although you can do .tex files equally well. It has a lot of plugins, including a git history viewer. As a bonus, I can access the Linux terminal from within (I love my bash scripts) and there is a plugin that lets you load Jupyter notebooks as well!

You can see my VSCode setup in the screenshot below: VSCode screenshot

Take a closer look at the Markdown file in the screenshot… Yup, this blogpost itself is written with VSCode. Quite meta, isn’t it?

So I’m starting to slowly incorporate this VSCode setup into my workflow, and we will see if it gains traction.

To summarise, this is my current lab notebook setup:

Pen and paper: scratchpad stuff
Slides (one presentation per project): figures, schemes, narrative of the project, current result with timestamp
Jupyter notebook: graphs, data processing
Hourly log script: for self-tracking, probably will be redundant if using git
Trying out: Markdown files, editing with Visual Studio Code, version control with git (integrated in VSCode)

Actually, even after brain-dumping all this notebooking stuff, I’m still a bit fuzzy about the concept. I guess, for now, my lab notebook philosophy is: track as much as possible, with least effort possible.

Share on

Twitter Facebook Google+ LinkedIn

Your email address will not be published. Required fields are marked *

Link roundup: 2024

1 minute read

Published: December 31, 2024

Science
Chaos and cause

To fully appreciate what this means, heed a lesson from Fyodor Dostoyevsky’s great novel, The Brothers Karamazov (1880), which asks how a benevolent God could allow suffering. There is just one virtuous character in the novel, the monk Father Zosima, whose simple teaching, dictated through the genius of Dostoyevsky, sheds light on chaos, causation and difference-making:
See, here you have passed by a small child, passed by in anger, with a foul word, with a wrathful soul; you perhaps did not notice the child, but he saw you, and your unsightly and impious image has remained in his defenceless heart. You did not know it, but you may thereby have planted a bad seed in him, and it may grow, and all because you did not restrain yourself before the child, because you did not nurture in yourself a heedful, active love … for one ought to love not for a chance moment but for all time. Anyone, even a wicked man, can love by chance. My young brother asked forgiveness of the birds: it seems senseless, yet it is right, for all is like an ocean, all flows and connects; touch it in one place and it echoes at the other end of the world.

Link roundup: Jan–Jun 2023

less than 1 minute read

Published: June 30, 2023

Science
Go Ahead, Try to Explain Milk
Killer Heat Waves Are Coming
Science Shows Why Traditional Kimchi Making Works So Well
A New Approach to Computation Reimagines Artificial Intelligence
The Computer Scientist Peering Inside AI’s Black Boxes

Others
The Case Against Travel
How to Keep Life from Becoming a Parody of Itself: Simone de Beauvoir on the Art of Growing Older
Is Wine Fake?
The Sound of Home: Sonorous Desert by Kim Haines-Eitzen
The Meaning of Life
The Dao of Using Your Smartphone
Camus’s Atheism and the Virtues of Inconsistency
Fatal Distraction: Forgetting a Child in the Backseat of a Car Is a Horrifying Mistake. Is It a Crime?

Link roundup: Apr–Dec 2022

1 minute read

Published: December 31, 2022

Science
Wood spirits: How Japan made the world’s first liquor from trees
The price of ‘sugar free’: are sweeteners as harmless as we thought?
A language model beats alphafold2 on orphans
https://github.com/FellowsFreiesWissen/computational_notebooks Why Conventional Wisdom About Cancer Can Be Misleading
Machine Learning to Handle the Proteome
‘The entire protein universe’: AI predicts shape of nearly every known protein
Could machine learning fuel a reproducibility crisis in science?
Blots on a field?
PNAS | Leveraging nonstructural data to predict structures and affinities of protein–ligand complexes
Breaking into the black box of artificial intelligence

Others
When a Houseplant Obsession Becomes a Nightmare
Book Review: What We Owe The Future
If Someone Is Typing, Then Stops … Can I Ask Why?

Link roundup: Jan–Mar 2022

2 minute read

Published: March 30, 2022

Since these AIs are just giant matrix multiplication machines, “intuition” now has a firm grounding in math - just much bigger, more complicated math than the usual kind that we call “logical”.

This would be a common pattern for sciences: much worse at everyday tasks than people who do them intuitively, until it generates some surprising and powerful new technology. Democritus figured out what matter was made of in 400 BC, and it didn’t help a single person do a single useful thing with matter for the next 2000 years of followup research, and then you got the atomic bomb (I may be skipping over all of chemistry, sorry).
– What Are We Arguing About When We Argue About Rationality?

What he seeks to practice is, in a phrase popularized by the Marxist philosopher Antonio Gramsci, “pessimism of the intellect, optimism of the will.”
– Can Science Fiction Wake Us Up to Our Climate Reality?

Caulfield then introduced two different ways of thinking about how we engage with ideas when we’re on the internet: The web as a garden and the web as a stream. Think of the web as an organically developing garden: a space in which there’s no predetermined order or relationship of things to one another. Caulfield writes, “Every walk through the garden creates new paths, new meanings.” What came first in the garden doesn’t matter either. Each thing in the garden is related to the other things as it exists in the moment.
– The Faithful Gardener

Science
Dual use of artificial-intelligence-powered drug discovery
Twelve quick tips for software design
Computer Scientists Prove Why Bigger Neural Networks Do Better
Failing the test: DNA barcoding brought botanist Steven Newmaster scientific fame and entrepreneurial success. Was it all based on fraud?
What’s the buzz? Let’s talk about numbing ingredients
The pandemic’s true death toll: millions more than official counts
5 nutrition goals that are better than weight loss

Others
https://github.com/csinva/imodels
Synaesthetics
Transformative Experience and Pascal’s Wager
Do Good Doorbell Cams Make Good Neighbors?
How to Want Less
It’s Your Friends Who Break Your Heart
How to be useless
What We Don’t Want to Know
It’s Time for Some Game Theory
Why does woman have ‘man’ in it and female has the word ‘male’ in it?

Yossa Dwi Hartono