Bioinformatics Tool Chest: R Programming Language

Posted by: Eli Roberson  :  Category: General, Technical

Data

Scientists love data. Call it a character flaw, but most of us can’t get enough. More data, more! But the data alone are just the start. To really be useful, we have to do something with the data. Model. Summarize. Evangelize it. Something. Who hasn’t needed to plot a standard curve? Or find the mean value of a series of numbers? What should you do when you have these questions.

The Problem

Many scientists turn to our friend Excel to solve these problems. It’s easy to work with, and you can even make graphs easily. That isn’t necessarily a good thing, as perfectly nice people make really bad graphs because those fancy 3D features are so tantalizing. Everyone interested in bioinformatics or computational biology needs a tool in their tool chest that can handle:

  1. statistics
  2. figure, graph creation
  3. very large data

The Solution

Look no further friends, your savior has arrive, and its name is R. R is a free, cross-platform, open-source derivitive of the S language. In case you didn’t catch that last part: R is free. You can download R from the nearest mirror to get started.

The Good

  • Freely available
  • Open-source — can compile it to your needs (OS, cpu, available memory, optimization levels)
  • Tons of add on packages
  • Scriptable
  • Ability to write own functions and packages
  • Able to handle large datasets
  • Interfaces with compiled languages
  • Can save plots as Post-scripts (print quality)
  • Extensive tutorials online along with mailing lists and archives for trouble shooting

The Bad

  • Command-line interface
  • Can be slow reading large files
  • Interpreted language (can be slower than compiled code)
  • No tech support line
  • Steep learning curve for beginners, especially non-programmers

5 Responses to “Bioinformatics Tool Chest: R Programming Language”

  1. Abbas Says:

    Hi,

    Do you consider R programming more useful than Perl for the average Bioinformaticist?

    -Abbas

  2. Eli Roberson Says:

    For anything that involves stats or figure generation I use R because (1) there are so many packages that already know how to perform statistical tests (2) its easy for me to make my own functions and (3) I can generate postscripts exactly in the form I want using R.

    But the bottom-line is that it depends entirely on what you want to do. If I’m doing text file manipulation, I’ll probably use Perl. But if I’m needing to perform an ANOVA on a set of data, I’ll go with R.

  3. The Bioinformatics Blog » Blog Archive » Bioinformatics Tool Chest: Bioconductor Says:

    [...] up on the previous bioinformatics tool chest post, I thought I’d cover Bioconductor next. Bioconductor is [...]

  4. The Bioinformatics Blog » Blog Archive » The New York Times Nods to R Says:

    [...] a previous Bioinformatics Toolchest post I discussed R, a statistical programming software that I’m a [...]

  5. FREDRICK Says:


    CheapTabletsOnline.com. Canadian Health&Care.Special Internet Prices.No prescription online pharmacy.Best quality drugs. High quality drugs. Order pills online

    Buy:Zetia.Benicar.Prozac.Ventolin.Lipothin.Seroquel.SleepWell.Acomplia.Lipitor.Female Cialis.Buspar.Amoxicillin.Wellbutrin SR.Aricept.Lasix.Advair.Female Pink Viagra.Zocor.Nymphomax.Cozaar….

Leave a Reply