As 2021 draws to a close, it’s time to check back in on a previous post. As a recap, Garmin Connect offered four challenges to run 3 x 505 km and 1 x 505 km, one in each of the four quarters of 2021. Completing all four would mean running 2021 km in 2021. I […]
Tag: Rstats
Under The Same Name: Journal titles and band names
Some journals sound like they should be bands. Whereas some journal titles ARE in fact the same as band names. I wondered… how many journal titles are also band names. Let’s find out! The journals cited in PubMed could be downloaded as a text file from here. This list includes every MEDLINE journal – even […]
Over and Over: Preprint revisions on bioRxiv
The aim of this post is to look at revisions of bioRxiv preprints. I’m interested how long preprint versions exist on bioRxiv. In other words: how long do revisions to preprints take? The data from bioRxiv is a complex dataset with many caveats as I’ll explain further down, but some interesting details do emerge. Consider […]
Ten Years vs The Spread II: Calculating publication lag times in R
This post is an update of a previous analysis on quantixed. We have covered publication lag times – the time it takes for a paper go from submitted (received) to accepted and published – a lot on here. It is possible to look at lag times, in R using data from PubMed. Previously my code […]
The Rise and Fall: impact of the COVID-19 pandemic on bioRxiv preprints
As the COVID-19 pandemic continues, different countries are experiencing various restrictions including lockdowns. Some of these restrictions alter our ability to do science: by hindering lab access or taking time away from researchers for homeschooling. So, what impact has the pandemic had on scientific output? One way to look at this – for biology – […]
Os Mutantes: an implausible naming system for SARS-CoV-2 mutants
The scientific response to the COVID-19 pandemic has been astounding. Aside from efforts to generate vaccines, the genomic surveillance of the virus has been truly remarkable. For example, the nextstrain project has sequence many SARS-CoV-2 genomes. In fact, the rapid identification of multiple new strains and mutations by diverse groups of scientists has resulted in […]
Target: monitoring a running goal in R
Many running races and events have been cancelled or disrupted due to the pandemic. To fill the void, I’ve been taking on Garmin’s “Challenges”. In Garmin Connect, you can accept a challenge set by Garmin (I think users can challenge each other too). Completion of these challenges gives the user points, which I’ve become somewhat […]
Color Bars
Here is a fun post about using colour palettes in R. It starts with a computer game… After a few years of sporadically playing Super Mario World 2 – Yoshi’s Island on the Retropie, I made it to the final level. In the background, as Bowser approached, I noticed that those coloured bars in the […]
Represent: geographical breakdown of a virtual seminar series
During the pandemic, many virtual seminar programmes have popped up. One series, “Motors in Quarantine“, has been very successful. It’s organised by my colleagues Anne Straube, Alex Zwetsloot and Huong Vu. Anne wanted to know if attendees of the seminar series were a fair representation of the field. We know the geographical location of the […]
Running Free: Calculating Efficiency Factor in R
Joe Friel reposted an article earlier this year on Efficiency Factor in running. Efficiency Factor (EF) can be viewed in Training Peaks software and he describes how it is calculated. This post describes how I went about calculating EF in R using a single gpx file. What is Efficiency Factor (EF)? Essentially, EF is the […]