On a scientist’s Google Scholar page, there is a list of co-authors in the sidebar. I’ve often wondered how Google determines in what order these co-authors appear. The list of co-authors on a primary author’s page is not exhaustive. It only lists co-authors who also have a Google Scholar profile. They also have to be […]
Tag: Rstats
Mr. Mastodon Farm: analysing a mastodon ActivityPub outbox.json file
I migrated my personal Mastodon account from mastodon.social to biologists.social recently. If you’d like to do the same, I found this guide very useful. Note that, once you move, all your previous posts are left behind on the old instance. Before I migrated, I downloaded all of my data from the old instance. I thought […]
Step By Step: recreating a volcano plot in R
We have an analysis routine for proteomics data written for IgorPro. One output is a volcano plot. These plots show the fold change in one sample compared to another and plot that against a p-value to estimate how reproducible any changes observed are. This post is not about that software, but on the topic of […]
Pledging My Time VI: scraping and analysis of race results in R
I’ve posted in the past about analysing race results in R (most recently here). I ran the 2023 MK Marathon and wanted to have a look at the finishing times. The days of race results being made available as a csv or xls for easy analysis seem to be behind us. Instead they tend to […]
Yet Another Movie: IMDB Top 250 movies
I’m not a big movie person. Nonetheless I have a media library with quite a few films in and I wondered how many “films to see before you die”-type movies I had in the collection, and how many were missing. I used R to find the answers. I’ve described previously how to get a plain […]
Probot: building a Mastodon bot
I have long admired albums2hear, a Twitter bot that posts albums. You can read a bit more about it here. There was no mastodon equivalent and so I decided to build one. You can follow the bot – currently called Albums Albums Albums (or AlbumsX3) – here. Idea behind the bot The idea is to […]
Running Around: 2022 running dataviz in R
2022 was my best year for running to date. In 2021, my goal was to run 2021 km. For 2022, I wanted to see if I could run 2500 km and also to run 50 HM-or-more distance runs. I managed both and ended the year on a total of 2734 km. I also bagged two […]
Twitcher II: tweet frequency and top tweets
Another post looking at Twitter data in R. It follows this one and this one. I wanted to look again at my tweeting frequency over the 12 years on Twitter, but this time do it in a calendar view. Something like a GitHub commit calendar would be perfect. I have used a library for this […]
Any Time At All: tweet frequency around the clock
Please consider this a “supplementary analysis” to my previous post looking at the frequency of tweets from my personal account over the last 12 years. I was curious about what times I was active on Twitter (measured by when I tweeted). Others might be interested in a solution to look at this in R. The […]
Twitcher: tweet frequency over the years
At the time of writing, I have essentially left Twitter. It was a fun ride and without going into what’s happening there now, this is a good opportunity to look at my 12 years on the platform. Early in November, I downloaded my data and locked my Twitter account. This gave me all the data […]