High performance computing helps saves the NZ stick insect

With the increasing use of next generation DNA sequencing technologies as a regular part of biological research, vast amounts of DNA sequence data are being generated at an ever increasing rate. But with this added capability and power comes technical challenges around data management, processing and analysis. Genomics work carried out in Landcare is no exception when it comes to encountering such problems. An example of these challenges is found is some recent work led by Landcare Research scientist Thomas Buckley and carried out by post-doctoral researcher Alice Dennis and PhD student Luke Dunning.

Alice and Luke are interested in the functional coding regions of genomes of stick insect species native to New Zealand. Until recently processing their insect DNA sequence data collected from the next generation sequencing-by-synthesis platforms took one whole week per individual; even when using a fast multi-core desktop Linux machine with plenty of RAM.

The length of time to process the sequence data significantly slows down their research program and limits the number of such processing steps that can be undertaken within any given project. To improve on this problem Dan White (Informatics team) worked with Alice and Luke to use the computing resources within the National e-Science Infrastructure (NeSI) to shift their memory absorbing processes to NeSI’s high performance computing (HPC) resources which Landcare Research has access to as a member of NeSI. In comparison to the resources available at Landcare Research, the NeSI HPC resources that are now available include an 80 processor computer with each processor having 12 cores and 96 GB of RAM and access to 200TB of disk storage. For the stick insect case above early tests have shown a reduction in processing time from one week to just over 3 hours; a dramatic and significant improvement! Further improvements are expected as we explore the options of processing multiple files simultaneously and by chaining multi step sequence analyses together in an automated process.

· High powered computing saves the NZ stick insect: With the establishment and increasing use of next generation DNA sequencing technologies as a regular part of biological research vast amounts of DNA sequence data are being generated at an ever increasing rate. But with this added capability and power comes technical challenges around data management, processing and analysis. Genomics work carried out in Landcare is no exception. An example of these challenges is found is some of the work carried out by postdoctoral researcher Alice Dennis, and PhD student Luke Dunning, who work with Thomas Buckley. Alice and Luke are interested in the functional coding regions of genomes of stick insect species native to New Zealand. But processing their insect DNA sequence data collected from the next generation sequencing-by-synthesis platforms took one whole week per individual; even when using a fast multi-core desktop Linux machine with plenty of RAM. This type of delay significantly slows down their research program and limits the number of such processing steps that can be undertaken within any given project. To improve on this problem Dan White (Informatics) worked with Alice and Luke, and the computing resources within the National e-S

High powered computing saves the NZ stick insect: With the establishment and increasing use of next generation DNA sequencing technologies as a regular part of biological research vast amounts of DNA sequence data are being generated at an ever increasing rate. But with this added capability and power comes technical challenges around data management, processing and analysis. Genomics work carried out in Landcare is no exception. An example of these challenges is found is some of the work carried out by postdoctoral researcher Alice Dennis, and PhD student Luke Dunning, who work with Thomas Buckley. Alice and Luke are interested in the functional coding regions of genomes of stick insect species native to New Zealand. But processing their insect DNA sequence data collected from the next generation sequencing-by-synthesis platforms took one whole week per individual; even when using a fast multi-core desktop Linux machine with plenty of RAM. This type of delay significantly slows down their research program and limits the number of such processing steps that can be undertaken within any given project. To improve on this problem Dan White (Informatics) worked with Alice and Luke, and the computing resources within the National e-Science Infrastructure (NeSI) to shift their memory absorbing processes to NeSI’s high powered computing (HPC) resources  which LCR has access to as a member of NeSI. In comparison to the resources available at LCR, the NeSI HPC resources that were now available include an 80 processor computer with each processor having 12 cores and 96 GB of RAM and access to 200TB of disk storage. For the stick insect case above early tests have shown a reduction in processing time from one week to just over 3 hours; a dramatic and significant improvement! Further improvements are expected as we exploring the options of running multiple files simultaneously and by chaining multi step sequence analyses together in an automated process.

cience Infrastructure (NeSI) to shift their memory absorbing processes to NeSI’s high powered computing (HPC) resources which LCR has access to as a member of NeSI. In comparison to the resources available at LCR, the NeSI HPC resources that were now available include an 80 processor computer with each processor having 12 cores and 96 GB of RAM and access to 200TB of disk storage. For the stick insect case above early tests have shown a reduction in processing time from one week to just over 3 hours; a dramatic and significant improvement! Further improvements are expected as we exploring the options of running multiple files simultaneously and by chaining multi step sequence analyses together in an automated process.

Tags: , , , ,
Posted in General | No Comments »

Kaipara Harbour – Update to website and online atlas

The Informatics development team have released a major upgrade to the Integrated kaipara Harbour Management group (IKHMG) website and on-line atlas – http://www.kaiparaharbour.net.nz/.

The IKHMG was established to promote integrated management and inter-agency coordination and kaitiakitanga of the Kaipara Harbour and its catchment for creating a healthy and productive Kaipara harbour. The redesigned website brings a new look and organisation to key information on the group’s core objectives; activities; and interests ,and provides new access to maps and the groups knowledge-base allowing information and data that describes the Kaipara Harbour to be explored.

This work was collaboration between the Landcare Research Informatics group and Te Uri o Hau through funding from the Te Uri o Hau Settlement Trust.

Tags: ,
Posted in General | No Comments »

New web mapping application tool launched – Our Environment

Following on from the successful launch in September of last year of the S-Map Online web mapping web site, the Informatics team recently unveiled a new and free online mapping tool which will make it easier for businesses, government, researchers and the public to better understand the natural environments that underpin New Zealand’s economy and society.

Our Environment Home Page

Our Environment (http://ourenvironment.scinfo.org.nz) is a new web mapping tool which provides easy access to information on New Zealand’s land environments and land resources. Based on “Google Maps” style navigation the mapping tool makes it easy to search, view and query environmental information. Using Our Environment users can

  • Use interactive maps to learn about their local environment and regional variations.
  • Select particular data themes such as vegetation, land suitability or surface rock type to view as maps.
  • View detailed information from a particular data theme.
  • Obtain further details about a particular theme (metadata).
  • Create high quality, custom PDF maps for printing.

Our Environment uses data from nationally significant databases such as the Land Resource Inventory System (LRIS), the Land Cover Database (LCDB) and other national datasets and is designed to support applications at local, regional and national scales.  Further data themes and features will be added as time goes by.

The Our Environment Mapping Application

The Our Environment mapping application is built using the new geospatial-portal-software platform for visualising geospatial data and for querying datasets. Designed and written by the Informatics team the platform is based on open standards and open source software (including PostGIS, MapServer, Geo Tools, MapCache (formerly mod-geocache) and Open Layers). Taking an across Landcare view of the future requirements for delivery of environmental science data through rich data visualisations, the platform has been designed so that it can be used, and extended if required, to roll out portal applications with a location-based focus. S-map Online was the first such portal, Our Environment is the second.  Other Landcare datasets have been already identified as being suitable for delivery through new instances of the web mapping application.

Tags: , , , , ,
Posted in General | No Comments »

New Zealand Virtual Herbarium launches

The New Zealand Virtual Herbarium (www.virtualherbarium.org.nz) has been launched by the New Zealand National Herbarium Network with Informatics having a significant input to the website and the underlying database.

The NZVH is a collaborative project involving 11 herbaria in New Zealand, developed under the auspices of the New Zealand National Herbarium Network, with financial assistance from Terrestrial and Freshwater Biodiversity Information System (TFBIS) Programme. It provides free internet access to over 700,000 records held by 11 botanical collections from Auckland to Dunedin. The records relate to over 1.4 million specimens of mosses, lichens, liverworts, ferns, algae and seed plants that have been collected in New Zealand from 1769 (Captain Cook´s first voyage ) to the present day, and cover all regions of the New Zealand mainland, as well as the offshore islands.

These herbaria records can now be searched simultaneously across the participating herbaria and results viewed as distribution maps or download. Each herbarium specimen is accompanied by information on its correct botanical name, where and when it was collected, by who, and also often ecological details.

While the larger collections have comprehensive records, until now users could not just go to their “local” herbarium to get all the information they needed. The New Zealand Virtual Herbarium pools all the available records in a ‘one stop shop’ making it easy for users to obtain the most complete picture for the species they are interested in, and generate distribution maps at the click of a button.  It also provides these data in a consistent format and standard.

The project to make these data accessible was implemented by the Informatics team using a platform that was initially developed for Australia’s Virtual Herbarium by the Council of Heads of Australasian Herbaria. Informatics undertook a major rebuild of the software and incorporated some new features to make it easier to use. Improvements were also made to the harvesting of records from each of the participating New Zealand  herbaria; NZVH consolidates all of these data into a single access point, providing a nation-wide view of botanical information.

Tags: , , , ,
Posted in General | No Comments »