Bioinformatics can offer efficient, automated, accelerated analysis of genetic data
Many biorefining companies engineer more than equipment and mechanical processes in technology development. They also engineer, design or breed microorganisms, whether algae, yeast or bacteria. To design the most efficient, cost-effective process technology possible, these companies must ensure that the microorganisms they employ are highly effective at performing whatever conversion necessary. This often involves extensive work with gene sequencing and analysis.
These operations can result in a huge amount of raw data that needs to be organized and analyzed. A variety of software solutions—on the market and in open-source formats—are available to aid researchers in analyzing genetic data sets. However, a unique software solution offered by Denmark-based CLC bio is proving to be the best software solution for a variety of biorefining companies, including Sapphire Energy Inc., BioGasol ApS, and Synthetic Genomics Inc.
One common way CLC bio’s bioinformatics solutions is used by industry, says Cecilie Boysen, the company’s senior consulting manager of the Americas, is that scientists identify a gene they want to optimize to make a particular organism perform better under industrial conditions. A company working with algae might want to optimize its ability to produce oil or grow under very specific conditions. Similarly, a company developing a cellulosic fermentation production technology may need to optimize its yeast or bacteria to produce under certain temperature variables.
The technologies and processes to perform this type of selection activity generally result in an overwhelming amount of data. Many companies have traditionally spent days or weeks analyzing it in a semi-automated fashion using several stand-alone software packages. In some cases, fully manual work is also completed using Excel spreadsheets. According to Boysen, the bioinformatics solution offered by CLC bio essentially serves as a pipeline that automates all of these activities. The result is an analysis process that is significantly less time-intensive, and less prone to human error.
CLC bio has been offering bioinformatics solutions since its founding in January 2005. According to Lasse Görlitz, the company’s director of communications, CLC bio was established by two brothers in Denmark. “The original idea was to make some powerful analysis tools and put them into the hands of scientists who can interpret the resulting data,” he says. In the past, molecular biologists and other researchers had to collaborate with programmers to analyze genetic data. This is because the algorithms used were not in the form of a graphical interface. Rather, they needed to be scripted and programmed, not only in the beginning, but throughout the analysis process assuming that the algorithms had to be tweaked to better perform. “You had to be able to script and program to actually run these analyses,” Görlitz said. “We wanted to be able to put the tools in the hands of those who actually had the knowledge to interpret the data.”
In the first three years of the company’s existence, approximately 1 million users downloaded the program. Considering the relatively small size of the scientific community needing this type of analysis tool, Görlitz calls the reach of the program outstanding and formerly unheard of in the sector. When next-generation gene sequencing became available in 2007, CLC bio decided to also serve that need in the scientific community. Next-gen sequencing is vastly less expensive than traditional methods and results in even greater amounts of data. CLC bio’s decision to service that need has paid off, says Görlitz, as the company’s software sales grew more than 30 percent last year, with profits growing by more than 50 percent.
Traditionally, the bioinformatics solution has been widely used by those in several sectors, including plant research, but interest from the biorefining community is relatively new. Görlitz stresses that his company is confident its presence in the sector will continue to grow dramatically.
Excerpt: In December, CLC bio announced that Sapphire Energy installed its bioinformatics solution. “At the core of our science operations and biology operations here, we institute a high throughput of biology capabilities, meaning we look at thousands and thousands of different traits simultaneously,” says Tim Zenk, Sapphire Energy’s vice president of corporate affairs. “Our objective in working with CLC bio was to bring the biotechnology informatics capability in-house so we can categorize and speed up the process of identifying traits and genes that we are looking for.”
According to Zenk, the solution implemented at Sapphire Energy required minimal customization and allowed Sapphire Energy employees to immediately go to work on a relatively standard platform they were already used to. While some customization was required to accommodate the specific genes his company is looking for, Zenk says implementation of the solution required very little work on the side of Sapphire Energy. “We implemented our internal intrascience intranet that enables our scientists to access it through the enterprise network,” he says. “CLC bio customized the software to meet our particular high-throughput biology needs, focused in on the algae genomes that we are looking for.”