Microflaccid Office Fail – 40 Years In The Desert

It turns out that one of the major data exchange formats for genetics is Microsoft Excel, and we have now discovered that the Redmond company’s flagship spreadsheet program has been autocorrecting the data into oblivion:

For many people, working with error-ridden spreadsheets is a way of life. This takes on added meaning for genomics researchers, who study the building blocks of life. It turns out that their work, too, is rife with dodgy spreadsheets.

A new paper has revealed the vast extent of errors in published genomics research, which is down to an unfortunate quirk of Microsoft Excel. A trio of scientists in Australia scanned 7,500 Excel files with gene lists accompanying 3,600 papers in 18 journals over a 10-year period. One-fifth of the files had easily identified errors, which is “quite striking and a little bit embarrassing,” says Mark Ziemann of the Baker IDI medical research institute in Melbourne, one of the paper’s co-authors.

What happened? By default, Excel and other popular spreadsheet applications convert some gene symbols to dates and numbers. For example, instead of writing out “Membrane-Associated Ring Finger (C3HC4) 1, E3 Ubiquitin Protein Ligase,” researchers have dubbed the gene MARCH1. Excel converts this into a date—03/01/2016, say—because that’s probably what the majority of spreadsheet users mean when they type it into a cell. Similarly, gene identifiers like “2310009E13” are converted to exponential numbers (2.31E+19). In both cases, the conversions strip out valuable information about the genes in question.

What on earth inspired all these researchers to use what can only be described as the greasy kid stuff of analysis and data storage for this purpose?

It’s nucking futz.

Medical Journal Editors' Financial Ties to Big Pharma Raise Systemic Conflict of Interest Concerns 29 March 2023, 4:54 AM at 4:54 AM on Back Loaded Bribery[…] American Association for the Advancement of Science (AAAS) per…
Tim Boudreau 7 July 2021, 8:39 PM at 8:39 PM on MoronsFond memories of the brief couple of days existence of Reaganbook, the…
Tim Boudreau 7 July 2021, 8:29 PM at 8:29 PM on The Parable of the Frog and the Scorpion in SiliconRecalling when Sun open sourced Sparc. It was a good move. So…
Matthew Saroff 4 July 2021, 1:41 AM at 1:41 AM on Today in Amazon Rat-F%$#eryI know that Glass-Steagall prevented banks from owning shares in the companies…
marku 3 July 2021, 9:56 AM at 9:56 AM on Today in Amazon Rat-F%$#eryGlass_Steagal? Or Sherman anti trust?
Stephen Montsaroff 3 July 2021, 1:27 AM at 1:27 AM on Not This Sh%$ AgainYes, first step to making colonies, I expect
Stephen Montsaroff 3 July 2021, 1:26 AM at 1:26 AM on Quote of the DaySuck a fanboi.
Jamie 3 July 2021, 1:26 AM at 1:26 AM on Tweet of the DayThere was no way to win militarily. $2.2 trillion later we would…

2 comments

29 August 2016, 10:30 PM at 10:30 PM

chiang01 says:

Clippy knew how to fix those types of errors, it never would have happened if he were on the job

Log in to Reply
30 August 2016, 1:08 PM at 1:08 PM

Matthew Saroff says:

Yuo win the internet today.

Log in to Reply

2 comments

Leave a Reply Cancel reply