Skip to main content

I'm an immunologist, microbiologist and computational biologist. I research cheese microbes, I'm creating an online biochemistry course and I have a podcast about the immune system.


Jass at the Lake

Converting the inlaws


Women in Biology, Computer Science and Computational Biology

2 min read

My colleague Melanie Stefan and I have just submitted a manuscript for review (you can see the preprint here) about the representation of women as authors on scholarly publications. 

The bare outline of our procedure is not revolutionary:

  1. We downloaded the information for a bunch (over 200,000) papers indexed in pubmed
  2. We computationally inferred the gender of the authors based on their first names (this process is somewhat complicated - you can find the code and some more explanation here)
  3. We analyzed the results, splitting the data a couple of different ways, bootstrapping statisticts etc

No surprise, women are under represented in computational biology, like they are everywhere else:

 Figure 1A - proportion of female authorship by author position.

There are a couple of points that I think are particularly interesting though. The first is that, if the senior author of the paper is female, women are much better represented at all other positions. Computational biology is still worse than biology as a whole, but the bio representation jumps to nearly 50%, and the computational biology jumps to 40%.

Paradoxically, I think that the most encouraging news comes from a graph that shows the lowest female representation. Pubmed data only allowed us to compare biology and computational biology, but what about computer science? For this, we turned to the arXiv - a preprint server for quantitative fields. We can't really compare this directly to the data from pubmed, but they do have a "quatitative biology" section. 

There, quantitative biology has better representation than computer science. It's still abysmal, don't get me wrong, but it suggests that maybe, just maybe, biology might be used as an inroads to get more women into computational and quantitative techniques. 

This gets at the question I'm most interested in - we know represenation is bad, but is there a way to improve it? These data aren't conclussive by any means, but they suggest there's a reason to try. 

Now I just need to get a job where someone will let me experiment on (with?) undergrads...


Submitted! Gender disparity in computational biology research publications | bioRxiv


Thank yous from Emmanuel College

Just got a thank you card from the lovely students in Prof. Deighan's microbiome course


Hopefully wrapping up HGT paper in the next week or so, excited to get started on the next thing:


NeuroLogica Blog » Framing the Debate on GMOs

Great discussion about the ways in which framing alters the way arguments are made and recieved.