We have an analysis routine for proteomics data written for IgorPro. One output is a volcano plot. These plots show the fold change in one sample compared to another and plot that against a p-value to estimate how reproducible any changes observed are. This post is not about that software, but on the topic of […]
Tag: ggplot
Pledging My Time VI: scraping and analysis of race results in R
I’ve posted in the past about analysing race results in R (most recently here). I ran the 2023 MK Marathon and wanted to have a look at the finishing times. The days of race results being made available as a csv or xls for easy analysis seem to be behind us. Instead they tend to […]
Running Around: 2022 running dataviz in R
2022 was my best year for running to date. In 2021, my goal was to run 2021 km. For 2022, I wanted to see if I could run 2500 km and also to run 50 HM-or-more distance runs. I managed both and ended the year on a total of 2734 km. I also bagged two […]
Twitcher II: tweet frequency and top tweets
Another post looking at Twitter data in R. It follows this one and this one. I wanted to look again at my tweeting frequency over the 12 years on Twitter, but this time do it in a calendar view. Something like a GitHub commit calendar would be perfect. I have used a library for this […]
Any Time At All: tweet frequency around the clock
Please consider this a “supplementary analysis” to my previous post looking at the frequency of tweets from my personal account over the last 12 years. I was curious about what times I was active on Twitter (measured by when I tweeted). Others might be interested in a solution to look at this in R. The […]
Twitcher: tweet frequency over the years
At the time of writing, I have essentially left Twitter. It was a fun ride and without going into what’s happening there now, this is a good opportunity to look at my 12 years on the platform. Early in November, I downloaded my data and locked my Twitter account. This gave me all the data […]
Get Miles: using treemap to visualise running distances
By 30th September 2022, I had clocked up a total of over 2000 km of running in 2022. This milestone was a good opportunity to look at how I got to this point. The code is shown below. First, we can make a histogram to look at the distance of runs. From this type of […]
Pledging My Time V: analysing race results in R
It’s been a while since I posted a breakdown of half marathon times. The last time seems to have been 2018. I decided to give my old code a clean-up and quickly crunched the numbers from the 2022 Kenilworth Half Marathon. First, the results: Briefly, the code below reads in a csv file of race […]
Os Mutantes: an implausible naming system for SARS-CoV-2 mutants
The scientific response to the COVID-19 pandemic has been astounding. Aside from efforts to generate vaccines, the genomic surveillance of the virus has been truly remarkable. For example, the nextstrain project has sequence many SARS-CoV-2 genomes. In fact, the rapid identification of multiple new strains and mutations by diverse groups of scientists has resulted in […]
Represent: geographical breakdown of a virtual seminar series
During the pandemic, many virtual seminar programmes have popped up. One series, “Motors in Quarantine“, has been very successful. It’s organised by my colleagues Anne Straube, Alex Zwetsloot and Huong Vu. Anne wanted to know if attendees of the seminar series were a fair representation of the field. We know the geographical location of the […]