Plotting business: Automated linear regression with Grace

5 minute read

Published: February 20, 2019

What software do you use to plot?

Like most people my first experience was with Excel. I remember the settings were quite confusing and not intuitive (this was Excel 2000/2003, no idea about current version). Then there are commercial softwares like Origin, which I used for here and there. It is quite user-friendly, but 1) I try to avoid pricy software packages when I can, and 2) commercial packages usually only run on Windows and I haven’t had main Windows workstation in years. My experimental colleagues use mostly GraphPad, and the graphs look nice, but I never used it for the same reasons as Origin.

A comment about Google Sheets: one would expect it will be similar to Excel, but I’m pleasantly surprised that it is easy to use and yields pretty graphs. I use this for simple charts that do not need heavy customisation/annotations, e.g. histogram.

Most recently I have learned to use the myriad of Python packages: matplotlib, holoviews and the like. They are indeed powerful, pretty, and customisability is certainly there as far as your Python fu allows. But the learning curve is pretty high. Even when generating simple plots, I have to look up stuff again, even though I already have a previous script as template.

So, I found myself keep going back to Grace, an ancient(?) and arcane plotting software I first learned in undergraduate days when I first encountered a Linux machine. Ok I always thought it’s ancient (like 70s-80s) because of how the GUI looks, but it was released in 1990s. Anyway, its graphs have a sort of signature look to it, and one can still see it in journal papers, especially molecular dynamics papers.

I can think of a few reasons why Grace sticks. First, you can get a plot very quickly from a plain-text file. In matplotlib, you have to import the data first into pandas dataframe and whatnot. Second, the plot file itself is a plain-text file containing the parameters and the data. Plain text means that it can be modified with the bash arsenal of text manipulation. Arguably, matplotlib is the same in this regard but it is a little different. The matplotlib .py script would be the instructions to construct the graph, while the Grace .agr file have the parameters and their values there (although it’s also possible to to save Grace instruction file – this is called the batch file). For holoviews for example, I found myself looking up for the correct commands/instructions to set some parameters, while in Grace I can just go to to the parameter in question and change its value.

Here I will demonstrate how I use Grace and some tips and tricks. Suppose you have data.dat that you want to plot:

# if you don't have some data handy, use this random number generator
# generates 10 numbers less than 100
for i in {1..10}; do echo $i $(expr $RANDOM % 100); done > data.dat

# cat data.dat
11
3
29
87
15
38
6
55
7
32

Simply call Grace to plot it:

# Grace with GUI
xmgrace data.dat

And you will see a meh-looking plot. But the power of Grace is in the script automation. Save this plot as data.agr. Here is fit.par which will do linear regression and plot the regression plot:

# cat fit.par
with g0
view ymin 0.45
view ymax 0.85
s0 symbol 3
s0 symbol size 0.400000
s0 line type 0
# regression formula
fit formula "y = a0 + a1*x"
fit with 2 parameters
fit prec 1e-5
# run regression, 100 iterations
nonlfit(s0,100)
# duplicate data from original data (s0) to s1
copy s0 to s1
s1 symbol 0
s1 line type 1
s1 line color 7
s1 type xy
# overwrite y to make regression line
s1.y = a0 + a1*x
autoscale

Apply fit.par:

# Grace with command-line interface
grace data.agr -param fit.par -saveall data.fit.agr -hardcopy -noprint > data.fit.log

You will be glad when you have 15 plots to do linear regression on like I do. I guess you can do the clicking around on the GUI 15 times but what will you do if you have 100 plots?

One more thing I needed was to put the R² values in the graphs. I did this in a very roundabout way (please tell me if you have a more elegant solution):

Annotate each of the 15 graphs (I have put them in one file, plot.agr) with textboxes containing the strings “corr1”, …, “corr15”
Extract R-value from data.fit.log and calculate R² ```bash \rm correlation.dat for i in {1..15}; do grep -H Correlation data$i.fit.log » correlation.dat done awk ‘{printf %.2f\n”, $3^2}’ correlation.dat > corr_squared.dat
Make a dictionary corr.dict such that ‘corr1’ corresponds to first value of R², and so on
```
# cat corr.dict
corr1 0.71
...
corr15 0.80
```

Replace plot.agr (the 15 graphs) consulting the dictionary (remember what I said about plain-text .agr being amenable to text manipulation?):

awk 'NR == FNR {
  rep[$1] = $2
  next
} 
{
  for (key in rep)
    gsub(key, rep[key])
  print
}' corr.dict plot.agr > plot_check.agr

Grace is powerful, but unfortunately the documentation, especially the scripting part, is a bit sparse. Hopefully that will improve!

Share on

Twitter Facebook Google+ LinkedIn

Your email address will not be published. Required fields are marked *

Link roundup: 2024

1 minute read

Published: December 31, 2024

Science
Chaos and cause

To fully appreciate what this means, heed a lesson from Fyodor Dostoyevsky’s great novel, The Brothers Karamazov (1880), which asks how a benevolent God could allow suffering. There is just one virtuous character in the novel, the monk Father Zosima, whose simple teaching, dictated through the genius of Dostoyevsky, sheds light on chaos, causation and difference-making:
See, here you have passed by a small child, passed by in anger, with a foul word, with a wrathful soul; you perhaps did not notice the child, but he saw you, and your unsightly and impious image has remained in his defenceless heart. You did not know it, but you may thereby have planted a bad seed in him, and it may grow, and all because you did not restrain yourself before the child, because you did not nurture in yourself a heedful, active love … for one ought to love not for a chance moment but for all time. Anyone, even a wicked man, can love by chance. My young brother asked forgiveness of the birds: it seems senseless, yet it is right, for all is like an ocean, all flows and connects; touch it in one place and it echoes at the other end of the world.

Link roundup: Jan–Jun 2023

less than 1 minute read

Published: June 30, 2023

Science
Go Ahead, Try to Explain Milk
Killer Heat Waves Are Coming
Science Shows Why Traditional Kimchi Making Works So Well
A New Approach to Computation Reimagines Artificial Intelligence
The Computer Scientist Peering Inside AI’s Black Boxes

Others
The Case Against Travel
How to Keep Life from Becoming a Parody of Itself: Simone de Beauvoir on the Art of Growing Older
Is Wine Fake?
The Sound of Home: Sonorous Desert by Kim Haines-Eitzen
The Meaning of Life
The Dao of Using Your Smartphone
Camus’s Atheism and the Virtues of Inconsistency
Fatal Distraction: Forgetting a Child in the Backseat of a Car Is a Horrifying Mistake. Is It a Crime?

Link roundup: Apr–Dec 2022

1 minute read

Published: December 31, 2022

Science
Wood spirits: How Japan made the world’s first liquor from trees
The price of ‘sugar free’: are sweeteners as harmless as we thought?
A language model beats alphafold2 on orphans
https://github.com/FellowsFreiesWissen/computational_notebooks Why Conventional Wisdom About Cancer Can Be Misleading
Machine Learning to Handle the Proteome
‘The entire protein universe’: AI predicts shape of nearly every known protein
Could machine learning fuel a reproducibility crisis in science?
Blots on a field?
PNAS | Leveraging nonstructural data to predict structures and affinities of protein–ligand complexes
Breaking into the black box of artificial intelligence

Others
When a Houseplant Obsession Becomes a Nightmare
Book Review: What We Owe The Future
If Someone Is Typing, Then Stops … Can I Ask Why?

Link roundup: Jan–Mar 2022

2 minute read

Published: March 30, 2022

Since these AIs are just giant matrix multiplication machines, “intuition” now has a firm grounding in math - just much bigger, more complicated math than the usual kind that we call “logical”.

This would be a common pattern for sciences: much worse at everyday tasks than people who do them intuitively, until it generates some surprising and powerful new technology. Democritus figured out what matter was made of in 400 BC, and it didn’t help a single person do a single useful thing with matter for the next 2000 years of followup research, and then you got the atomic bomb (I may be skipping over all of chemistry, sorry).
– What Are We Arguing About When We Argue About Rationality?

What he seeks to practice is, in a phrase popularized by the Marxist philosopher Antonio Gramsci, “pessimism of the intellect, optimism of the will.”
– Can Science Fiction Wake Us Up to Our Climate Reality?

Caulfield then introduced two different ways of thinking about how we engage with ideas when we’re on the internet: The web as a garden and the web as a stream. Think of the web as an organically developing garden: a space in which there’s no predetermined order or relationship of things to one another. Caulfield writes, “Every walk through the garden creates new paths, new meanings.” What came first in the garden doesn’t matter either. Each thing in the garden is related to the other things as it exists in the moment.
– The Faithful Gardener

Science
Dual use of artificial-intelligence-powered drug discovery
Twelve quick tips for software design
Computer Scientists Prove Why Bigger Neural Networks Do Better
Failing the test: DNA barcoding brought botanist Steven Newmaster scientific fame and entrepreneurial success. Was it all based on fraud?
What’s the buzz? Let’s talk about numbing ingredients
The pandemic’s true death toll: millions more than official counts
5 nutrition goals that are better than weight loss

Others
https://github.com/csinva/imodels
Synaesthetics
Transformative Experience and Pascal’s Wager
Do Good Doorbell Cams Make Good Neighbors?
How to Want Less
It’s Your Friends Who Break Your Heart
How to be useless
What We Don’t Want to Know
It’s Time for Some Game Theory
Why does woman have ‘man’ in it and female has the word ‘male’ in it?

Yossa Dwi Hartono