Picking up Python as a scientist

4 minute read

Published: December 03, 2018

My PhD supervisor prophetically decreed back in 2013 that “it might do thee some good to learn Python” (not his exact words).

I did heed his advice and looked up some tutorials but soon abandoned it since it was not directly relevant to what I was doing. Now that Python is all the buzz, I feel a tinge of regret for not persisting to learn Python. I do believe the ability to code is now essential for a scientist regardless of fields, but especially in biology where we encounter growing size of data. This Nature article <Programming: Pick up Python> lays out this argument in more details with some resources for you to begin picking up Python.

Exhibit A: Me picking up some python...

I did pick up Python starting late last year, right after my thesis defense, while waiting for my work visa. While it is true that many resources and tutorials are out there, since Python is a popular language, I often found that the tutorials are more geared towards software developers. Scientists often do not need the full-blown Python capabilities or follow certain style guidelines, since we often write small scripts with fewer than 100 lines, instead of a big software.

So here I am listing some points and good practices I pick up from learning Python that are good to know, and mention some advanced features that you can skip from tutorials if you just want to pick up Python quickly. Some of the points are not Python-specific, but good coding practices and/or ‘philosophy’ of sorts (try googling Zen of Python, for example).

Learning points for scientists from coding:

Code documentation
I often opened my old script files and took a few minutes to figure out what on earth I was doing. For scripts, code comments are often enough for documentation (here is a good commenting guide). But don’t overcomment, e.g.:
```
# assign value to counter -> redundant comment
counter = 0
```
Furthermore, I think scientists can learn from the way software developers do their documentation, specifically applied to our lab notebook. One needs to deliberately devote time and effort to document. I will elaborate on lab notebooking in another blogpost.
Code review and refactoring
This is related to the previous point. If you write readable and well-structured code, sometimes comments become unnecessary. Code has to be revisited and revised, not only for the logic, but for structure.
Naming files, variables, functions
Related to code readability. Variable name user_id is self-explanatory compared to x.
Dynamic typing; object types: int, float, str
Conditional and loop statements
Often these make up the bulk of the logic that you need from a script, so be sure to know them well.

Functions and abstractions
View your script in modular fashion: break them to steps and tuck each step in a function. This way of thinking is powerful to solve a big problem by breaking it down to manageable smaller problems.

# bash script
    
# the inner workings of do_thing_a and do_thing_b are not shown (abstracted): 
# - overall logical structure becomes clear
# - easy to comment out
main(){
do_thing_a
#do_thing_b # skip the whole function with line, instead of block, comment
}
  
do_thing_a(){
...
}
  
do_thing_b(){
...
}
  
main

File I/O (opening, reading, and writing file)
Python-specific:
- Object types: list, tuple, set, dictionary
- List comprehension
  Not that important, since it can always be replaced by loop, but it is a powerful Python feature. It is more succinct and faster than loop. On the other hand, it can be too terse that readability suffers.
- Packages: sys, sys.argv, os, math, numpy, pandas, matplotlib
- Pick a good text editor/IDE (I use vi, Jupyter notebook, and VSCode)
- So far don’t need: assertion, try, exception, classes, decorator

Finally, here are some Python courses/tutorials I have tried:

edX | Introduction to Computer Science and Programming Using Python
Comprehensive. Try this one if you need a solid foundation. Not suitable if you just want to get started quick.
edX | Python for Data Science
Introduces a lot of nice tools like Jupyter notebook and pandas.
Automate the Boring Stuff with Python
Practical stuff. Good for beginners who just want to pick up Python quickly.
DataCamp
Bite-size step-by-step lessons. Good if you have only small chunks of time here and there. You will also be given a small exercise right after each lesson to make it stick.
Software Carpentry tutorials, also for UNIX shell and git
How to think like a computer scientist
This one is my favourite. Things started to click for me here – perhaps because I’ve done the other Python tutorials at this point, but give it a try.

Share on

Twitter Facebook Google+ LinkedIn

Your email address will not be published. Required fields are marked *

Link roundup: 2024

1 minute read

Published: December 31, 2024

Science
Chaos and cause

To fully appreciate what this means, heed a lesson from Fyodor Dostoyevsky’s great novel, The Brothers Karamazov (1880), which asks how a benevolent God could allow suffering. There is just one virtuous character in the novel, the monk Father Zosima, whose simple teaching, dictated through the genius of Dostoyevsky, sheds light on chaos, causation and difference-making:
See, here you have passed by a small child, passed by in anger, with a foul word, with a wrathful soul; you perhaps did not notice the child, but he saw you, and your unsightly and impious image has remained in his defenceless heart. You did not know it, but you may thereby have planted a bad seed in him, and it may grow, and all because you did not restrain yourself before the child, because you did not nurture in yourself a heedful, active love … for one ought to love not for a chance moment but for all time. Anyone, even a wicked man, can love by chance. My young brother asked forgiveness of the birds: it seems senseless, yet it is right, for all is like an ocean, all flows and connects; touch it in one place and it echoes at the other end of the world.

Link roundup: Jan–Jun 2023

less than 1 minute read

Published: June 30, 2023

Science
Go Ahead, Try to Explain Milk
Killer Heat Waves Are Coming
Science Shows Why Traditional Kimchi Making Works So Well
A New Approach to Computation Reimagines Artificial Intelligence
The Computer Scientist Peering Inside AI’s Black Boxes

Others
The Case Against Travel
How to Keep Life from Becoming a Parody of Itself: Simone de Beauvoir on the Art of Growing Older
Is Wine Fake?
The Sound of Home: Sonorous Desert by Kim Haines-Eitzen
The Meaning of Life
The Dao of Using Your Smartphone
Camus’s Atheism and the Virtues of Inconsistency
Fatal Distraction: Forgetting a Child in the Backseat of a Car Is a Horrifying Mistake. Is It a Crime?

Link roundup: Apr–Dec 2022

1 minute read

Published: December 31, 2022

Science
Wood spirits: How Japan made the world’s first liquor from trees
The price of ‘sugar free’: are sweeteners as harmless as we thought?
A language model beats alphafold2 on orphans
https://github.com/FellowsFreiesWissen/computational_notebooks Why Conventional Wisdom About Cancer Can Be Misleading
Machine Learning to Handle the Proteome
‘The entire protein universe’: AI predicts shape of nearly every known protein
Could machine learning fuel a reproducibility crisis in science?
Blots on a field?
PNAS | Leveraging nonstructural data to predict structures and affinities of protein–ligand complexes
Breaking into the black box of artificial intelligence

Others
When a Houseplant Obsession Becomes a Nightmare
Book Review: What We Owe The Future
If Someone Is Typing, Then Stops … Can I Ask Why?

Link roundup: Jan–Mar 2022

2 minute read

Published: March 30, 2022

Since these AIs are just giant matrix multiplication machines, “intuition” now has a firm grounding in math - just much bigger, more complicated math than the usual kind that we call “logical”.

This would be a common pattern for sciences: much worse at everyday tasks than people who do them intuitively, until it generates some surprising and powerful new technology. Democritus figured out what matter was made of in 400 BC, and it didn’t help a single person do a single useful thing with matter for the next 2000 years of followup research, and then you got the atomic bomb (I may be skipping over all of chemistry, sorry).
– What Are We Arguing About When We Argue About Rationality?

What he seeks to practice is, in a phrase popularized by the Marxist philosopher Antonio Gramsci, “pessimism of the intellect, optimism of the will.”
– Can Science Fiction Wake Us Up to Our Climate Reality?

Caulfield then introduced two different ways of thinking about how we engage with ideas when we’re on the internet: The web as a garden and the web as a stream. Think of the web as an organically developing garden: a space in which there’s no predetermined order or relationship of things to one another. Caulfield writes, “Every walk through the garden creates new paths, new meanings.” What came first in the garden doesn’t matter either. Each thing in the garden is related to the other things as it exists in the moment.
– The Faithful Gardener

Science
Dual use of artificial-intelligence-powered drug discovery
Twelve quick tips for software design
Computer Scientists Prove Why Bigger Neural Networks Do Better
Failing the test: DNA barcoding brought botanist Steven Newmaster scientific fame and entrepreneurial success. Was it all based on fraud?
What’s the buzz? Let’s talk about numbing ingredients
The pandemic’s true death toll: millions more than official counts
5 nutrition goals that are better than weight loss

Others
https://github.com/csinva/imodels
Synaesthetics
Transformative Experience and Pascal’s Wager
Do Good Doorbell Cams Make Good Neighbors?
How to Want Less
It’s Your Friends Who Break Your Heart
How to be useless
What We Don’t Want to Know
It’s Time for Some Game Theory
Why does woman have ‘man’ in it and female has the word ‘male’ in it?

Yossa Dwi Hartono