computing – Page 2

Pre Self: what fraction of a journal’s papers are preprinted?

Answering the question of what fraction of a journal’s papers were previously available as a preprint is quite difficult to do. The tricky part is matching preprints (from a number of different servers) with the published output from a journal. The easy matches are those that are directly linked together, the remainder though can be […]

9th March 2024By Stephen Royle computing, publishing, science Crossref, PubMed, Rstats

Tips from the Blog XVI: getting FASTA sequences

I am having some fun running AlphaPulldown on a computing cluster. A requirement is to have input sequences in FASTA format. I found that I needed to get ~600 sequences. I had a list of the relevant Uniprot IDs. Surely getting the sequences for these proteins should be straightforward? Solution The Uniprot IDs can be […]

4th January 2024By Stephen Royle adventures in code, computing, science bioinformatics, tftb

Airy Area: approximating surface area of a cell from a 3D point set

In the spirit of “if it took you a while to find out how to do something, write about it”, I will detail a method to approximate the surface area of a 3D shape. Our application here was finding the surface area of a cell but it can be used on any shape. We start […]

21st November 2023By Stephen Royle computing alphashape3d, Rstats, statisticsOne Comment

All The Right Friends II: clustering papers using Google Scholar data

In a previous post, I looked at how Google Scholar ranks co-authors. While I had the data available I wondered whether paper authorship could be used in other ways. A few months back, John Cook posted about using Jaccard index and jazz albums. The idea is to look at the players on two jazz albums […]

29th October 2023By Stephen Royle computing, fun, publishing citations, Google Scholar, metrics, Rstats

Probot 2: upgrading a Mastodon bot

Earlier this year I set up a bot on Mastodon. The bot, AlbumsX3, posts an album suggestion twice-a-day. Performance has been good. It has only missed a few posts due – I think – to server glitches. However, I have made a couple of tweaks to upgrade the bot since my last post, so I […]

6th August 2023By Stephen Royle computing, fun mastodon, music, python

Mr. Mastodon Farm: analysing a mastodon ActivityPub outbox.json file

I migrated my personal Mastodon account from mastodon.social to biologists.social recently. If you’d like to do the same, I found this guide very useful. Note that, once you move, all your previous posts are left behind on the old instance. Before I migrated, I downloaded all of my data from the old instance. I thought […]

29th July 2023By Stephen Royle computing, fun json, mastodon, Rstats4 Comments

Free Bird II: Mastodon macOS clients

This is a brief review of macOS Mastodon clients that I’ve tried. It is unashamedly incomplete/non-exhaustive, but since the ones I found online from computing magazines literally look at one app, I am ahead of the pack here! tl;dr I prefer Ivory on macOS and prior to that, Mastonaut was OK. For clarity: I have […]

20th June 2023By Stephen Royle computing, fun ebou, fediverse, ivory, mastodon, mastonaut, whalebird

Step By Step: recreating a volcano plot in R

We have an analysis routine for proteomics data written for IgorPro. One output is a volcano plot. These plots show the fold change in one sample compared to another and plot that against a p-value to estimate how reproducible any changes observed are. This post is not about that software, but on the topic of […]

16th June 2023By Stephen Royle computing dataviz, ggplot, IgorPro, proteomics, Rstats, VolcanoPlot3 Comments

Pledging My Time VI: scraping and analysis of race results in R

I’ve posted in the past about analysing race results in R (most recently here). I ran the 2023 MK Marathon and wanted to have a look at the finishing times. The days of race results being made available as a csv or xls for easy analysis seem to be behind us. Instead they tend to […]

8th May 2023By Stephen Royle computing, fun ggplot, marathon, Rstats, running, rvest2 Comments

Yet Another Movie: IMDB Top 250 movies

I’m not a big movie person. Nonetheless I have a media library with quite a few films in and I wondered how many “films to see before you die”-type movies I had in the collection, and how many were missing. I used R to find the answers. I’ve described previously how to get a plain […]

29th April 2023By Stephen Royle computing, fun plex, Rstats