Documentation episode

1 minute read

Published: May 10, 2021

Software developers know this already but when working on the latest publication recently, I was reminded that we should document obsessively.

What was challenging in this project is, even though the amount of generated data is not massive (only about equivalent of thousands of spreadsheet rows), a lot of variables are involved in a complicated way. I briefly considered putting them in a pandas dataframe, but it proved to be tedious and impractical. Furthermore, there are many ways to slice it, and in the beginning it was not clear what was best. Different slices give slightly different result, and this is important because the result go to a testing pipeline. So we tried a couple of different slices, tested them, the pandemic happened, and some time later, uhh… which slices exactly did we do again? I thank my past self that although the folder is a bit messy, I retained all the raw data and all the bash scripts for each processing step was right there, from raw, right until beautiful publication figure. Although it did take a while to get my bearings again in the folder, thankfully I was able to retrace my steps again.

Here are some notable points to self:

Intermediary scripts should have checks built in. Expose the temporary files; don’t hide them in a black box.
Just text files is not so good for documentation. In this case I had a companion gSheets and jupyter notebook and they really help.
Put raw data and analysis in different folders (I didn’t do this)

Share on

Twitter Facebook Google+ LinkedIn

Your email address will not be published. Required fields are marked *

Link roundup: 2024

1 minute read

Published: December 31, 2024

Science
Chaos and cause

To fully appreciate what this means, heed a lesson from Fyodor Dostoyevsky’s great novel, The Brothers Karamazov (1880), which asks how a benevolent God could allow suffering. There is just one virtuous character in the novel, the monk Father Zosima, whose simple teaching, dictated through the genius of Dostoyevsky, sheds light on chaos, causation and difference-making:
See, here you have passed by a small child, passed by in anger, with a foul word, with a wrathful soul; you perhaps did not notice the child, but he saw you, and your unsightly and impious image has remained in his defenceless heart. You did not know it, but you may thereby have planted a bad seed in him, and it may grow, and all because you did not restrain yourself before the child, because you did not nurture in yourself a heedful, active love … for one ought to love not for a chance moment but for all time. Anyone, even a wicked man, can love by chance. My young brother asked forgiveness of the birds: it seems senseless, yet it is right, for all is like an ocean, all flows and connects; touch it in one place and it echoes at the other end of the world.

Link roundup: Jan–Jun 2023

less than 1 minute read

Published: June 30, 2023

Science
Go Ahead, Try to Explain Milk
Killer Heat Waves Are Coming
Science Shows Why Traditional Kimchi Making Works So Well
A New Approach to Computation Reimagines Artificial Intelligence
The Computer Scientist Peering Inside AI’s Black Boxes

Others
The Case Against Travel
How to Keep Life from Becoming a Parody of Itself: Simone de Beauvoir on the Art of Growing Older
Is Wine Fake?
The Sound of Home: Sonorous Desert by Kim Haines-Eitzen
The Meaning of Life
The Dao of Using Your Smartphone
Camus’s Atheism and the Virtues of Inconsistency
Fatal Distraction: Forgetting a Child in the Backseat of a Car Is a Horrifying Mistake. Is It a Crime?

Link roundup: Apr–Dec 2022

1 minute read

Published: December 31, 2022

Science
Wood spirits: How Japan made the world’s first liquor from trees
The price of ‘sugar free’: are sweeteners as harmless as we thought?
A language model beats alphafold2 on orphans
https://github.com/FellowsFreiesWissen/computational_notebooks Why Conventional Wisdom About Cancer Can Be Misleading
Machine Learning to Handle the Proteome
‘The entire protein universe’: AI predicts shape of nearly every known protein
Could machine learning fuel a reproducibility crisis in science?
Blots on a field?
PNAS | Leveraging nonstructural data to predict structures and affinities of protein–ligand complexes
Breaking into the black box of artificial intelligence

Others
When a Houseplant Obsession Becomes a Nightmare
Book Review: What We Owe The Future
If Someone Is Typing, Then Stops … Can I Ask Why?

Link roundup: Jan–Mar 2022

2 minute read

Published: March 30, 2022

Since these AIs are just giant matrix multiplication machines, “intuition” now has a firm grounding in math - just much bigger, more complicated math than the usual kind that we call “logical”.

This would be a common pattern for sciences: much worse at everyday tasks than people who do them intuitively, until it generates some surprising and powerful new technology. Democritus figured out what matter was made of in 400 BC, and it didn’t help a single person do a single useful thing with matter for the next 2000 years of followup research, and then you got the atomic bomb (I may be skipping over all of chemistry, sorry).
– What Are We Arguing About When We Argue About Rationality?

What he seeks to practice is, in a phrase popularized by the Marxist philosopher Antonio Gramsci, “pessimism of the intellect, optimism of the will.”
– Can Science Fiction Wake Us Up to Our Climate Reality?

Caulfield then introduced two different ways of thinking about how we engage with ideas when we’re on the internet: The web as a garden and the web as a stream. Think of the web as an organically developing garden: a space in which there’s no predetermined order or relationship of things to one another. Caulfield writes, “Every walk through the garden creates new paths, new meanings.” What came first in the garden doesn’t matter either. Each thing in the garden is related to the other things as it exists in the moment.
– The Faithful Gardener

Science
Dual use of artificial-intelligence-powered drug discovery
Twelve quick tips for software design
Computer Scientists Prove Why Bigger Neural Networks Do Better
Failing the test: DNA barcoding brought botanist Steven Newmaster scientific fame and entrepreneurial success. Was it all based on fraud?
What’s the buzz? Let’s talk about numbing ingredients
The pandemic’s true death toll: millions more than official counts
5 nutrition goals that are better than weight loss

Others
https://github.com/csinva/imodels
Synaesthetics
Transformative Experience and Pascal’s Wager
Do Good Doorbell Cams Make Good Neighbors?
How to Want Less
It’s Your Friends Who Break Your Heart
How to be useless
What We Don’t Want to Know
It’s Time for Some Game Theory
Why does woman have ‘man’ in it and female has the word ‘male’ in it?

Yossa Dwi Hartono

Documentation episode

Share on

Leave a Comment

You May Also Enjoy

Link roundup: 2024

Link roundup: Jan–Jun 2023

Link roundup: Apr–Dec 2022

Link roundup: Jan–Mar 2022