Welcome to the JABBA menagerie: a collection of animal-themed, bogus bioinformatics names…that have nothing to do with animals!

October 23, 2015 by Keith Bradnam

Bioinformaticians make the worst zookeepers:

A

2011 — ANTELOPE: Analysis of Networks through TEmporal-LOgic sPEcifications …has nothing to do with antelopes

B

2014 — BISON: BISulfite alignment On Nodes of a cluster …has nothing to do with bisons

C

2011 — CORAL: CORrection with ALignments …has nothing to do with corals

D

2010 — DODO: DOmain based Detection of Orthologs …has nothing to do with dodos

E

2011 — EMU: Extractor of MUtations …has nothing to do with emus
2014 — EAGLE: Enhanced Artificial Genome Engine (no 'L'?!?) …has nothing to do with eagles

F

2014 — FALCON: FAst Localization algorithm based on a CONtinuous-space formulation …has nothing to do with falcons
2015 — FROG: FingeRprinting Ontology of Genomic variations …has nothing to do with frogs
2017 — FROGS: Find, Rapidly, OTUs with Galaxy Solution …also has nothing to do with frogs

G

2015 — GECKO: GEnome Comparison with K-mers Out-of-core …has nothing to do with geckos
2009 — GORILLA: Gene Ontology enRIchment anaLysis and visuaLizAtion …has nothing to do with gorillas
2018 — GRASShopPER: GPU overlap GRaph ASSembler using Paired End Reads …has nothing to do with grasshoppers

H

2010 — HAMSTeRS: Haemophilia A Mutation, Structure, Test, and Resource Site…has nothing to do with hamsters

I

2013 — INSECT: IN-silico SEarch for Co-occurring Transcription factors …has nothing to do with insects

J

2014 — JAGuaR: Junction Alignments to Genome for RNA-seq reads … has nothing to do with jaguars

K

L

M

2002 — MOUSE: Mitochondrial and Other Useful SEquences …has nothing to do with mice
2014 — MONGOOSE: MetabOlic Network GrOwth Optimization Solved Exactly …has nothing to do with mongooses

N

O

2013 — ORCA: mOdel-dRiven disCovery and Analysis …has nothing to do with orcas

P

2015 — PANDA: Pathway AND Annotation explorer …has nothing to do with pandas
2005 — PANTHER: Protein ANalysis THrough Evolutionary Relationships …has nothing to do with panthers
2014 — PIGEONS: Photographically InteGrated En-suite for the OligoNucleotide Screening …has nothing to do with pigeons
2014 — PuFFIN: Positioning for Fuzzy and FIxed Nucleosomes …has nothing to do with puffins

Q

R

2013 — RAVEN: Reconstruction, Analysis and Visualization of mEtabolic Networks …has nothing to do with ravens

S

2006 — SPIDer: Saccharomyces Protein-protein Interaction Database …has nothing to do with spiders
2009 — SHRiMP: SHort Read Mapping Package …has nothing to do with shrimps

T

2008 — TiGER: Tissue-specific Gene Expression and Regulation …has nothing to do with tigers

U

V

W

X

Y

Z

2004 — ZEBRA: Zebra finch Expression BRain Atlas…has nothing to do with zebras

Other suggestions welcome! Only requirements are that:

The name is bogus, i.e. not a straightforward acronym and worthy of a JABBA award
The acronym is named after an animal (or animal grouping)
The software/tool has nothing to do with the animal in question

Great Scott! Five fun facts about DNA sequencing from 1985

October 21, 2015 by Keith Bradnam

As everyone is celebrating a certain 2015–themed calendar event today, I thought we could instead go back to the ~~future~~ past of DNA sequencing.

1.

Thirty years ago there were no automated sequencing machines. However, Sanger sequencing technology could still provide longer reads than most of Illumina's machines today, e.g. from this paper (A rapid procedure for DNA sequencing using transposon-promoted deletions in Escherichia coli):

The length of the sequence that could be read from each gel in a single run varied from 175 to 200 nt.

2.

The idea of sequencing nuclear genomes was still largely a pipe dream, but smaller genomes were tractable. 1985 saw the addition of the Xenopus laevis mitochondrial genome to the tiny collection of organelle genome sequences. Figure 3 of this paper displayed the full sequence, spread over six pages that looked like this:

Including long DNA sequences in journal articles was a surprisingly common practice at this time.

3.

There were two releases of GenBank in 1985. The second release saw the database grow to an astounding set of 5,700 sequences, totalling 5,204,420 bp. For comparison, this year also saw the release of the Commodore 128 home computer which came with 128 KB of RAM. The first 3.5" hard drives were only a couple of years old, and could store 10 MB (so capable of storing the DNA sequences in GenBank, but possibly not the associated annotation).

4.

The SEQ-ED program was published, allowing the handling of 'long DNA sequences' that were 'up to 200 Kbp'.

5.

Somewhat amazingly, people were writing bioinformatics software for Apple computers. The journal CABIOS included this paper:

PEGASE: a machine language program for DNA sequence analysis on Apple II microcomputer using a binary coding of nucleotides

But how did people distribute software in the days when there was no GitHub, SourceForge, or indeed…no world wide web?

For both code and source of PEGASE, please send two blank 5" diskettes and indicate precisely your system configuration (there is a slight difference between the Apple II+ and the Apple lIe version which depends on the availability of lower case characters).

BINGO, DINGO, PINGO, RINGO, and SPINGO

October 21, 2015 by Keith Bradnam

Sounds like these should be characters in a children's TV show.

Dovetail takes flight [Link] →

October 21, 2015 by Keith Bradnam

If you ever want to know about the latest developments in sequencing, you owe it to yourself to follow Keith Robison's blog. In his latest post he talks about the launch of the new de novo assembly service from Dovetail Genomics. Keith concludes:

Personally, a pure service offering is very attractive, since that means not having to find internal resources to learn the new technology and then execute on it. I checked with Dovetail, and while I don't have $40K burning a hole in my pocket, if I did I could grab something out of the garden or from the local seafood market, I really could have a complex genome scaffold of my very own in about two months. That's an exciting vision, and perhaps will be a major force in the sunsetting of science's tolerance for highly fragmented draft genomes.

Readers may also enjoy Bio-IT World's report on this new Dovetail service.