BioNMR

BioNMR (http://www.bionmr.com/forum/)
-   NMR software (http://www.bionmr.com/forum/nmr-software-15/)
-   -   CONNJUR - open source integration environment for biomolecular NMR data analysis (http://www.bionmr.com/forum/nmr-software-15/connjur-open-source-integration-environment-biomolecular-nmr-data-analysis-8797/)

markber 11-01-2010 07:45 PM

CONNJUR - open source integration environment for biomolecular NMR data analysis
 
CONNJUR Project website


The CONNJUR Project is developing an open source integration environment for biomolecular NMR data analysis. CONNJUR software is a workflow generator, based on legacy NMR analysis applications, that is being developed as Open Source Software -- it is perpetually free for anyone to use, modify and distribute.

CONNJUR is developed by a community of NMR spectroscopists and scientific programmers who aim to create and maintain NMR analysis tools bounded by the goals of excellent end-product, free of charge, open source software using the highest standards. The workflow for modern biomolecular NMR spectroscopy consists of three phases: spectral reconstruction, the process of converting time domain data into the frequency domain; spectral analysis, which includes peak identification and resonance assignment; and biophysical characterization, which includes all subsequent data analysis in which the spectroscopic data is used to draw biophysical inferences (such as structure determination). Despite the simplicity of this overview, the actual processing workflow for biomolecular NMR is incredibly intricate and complex, requiring the use of dozens of software tools. Each of these software tools has its own data format mandating a simultaneous requirement for format converters (usually PERL scripts) which provide the interface between these tools. The remainder of the data management issues is left in the hands of the spectroscopist, who is solely in charge of ensuring data integrity between the various phases and sub-phases of data analysis.

http://connjur.uchc.edu/img/oldWorkflow.gif

NMR Data Analysis Workflow Without Integration: Data analysis proceeds through several steps using various NMR processing tools. Key data must be shuttled form one tool to another, requiring format conversion for most steps. Additional Information is retrieved from external databases. AT all processing stages, critical data is often stored in paper copy, reducing the efficiency of archival retrieval and increasing the error-rate in NMR analysis.



<hr color="#bc80bd"> http://connjur.uchc.edu/img/connjurInfinityLogoCONNJUR: An Open Source Software Solution

The complicated nature of NMR data management can be alleviated through the use of a common data store, preferably a relational database whose management system can guarantee relational integrity between the various pieces of data derived at different steps along the workflow. Effective use of a relational database is predicated on the development of a data model, the development of which is underway by at least three research groups<sup>1-3</sup>. However, once such a data model and database are developed, the unavoidable issue becomes how to use it. One solution is to redevelop all of the existing software tools such that they store and retrieve data in the database<sup>1</sup>. Another solution is to provide a workbench environment with interfaces which allow existing tools to integrate with the relational database backend. The latter solution is desirable in that it implicitly supports legacy applications removing the requirement that the user learn the operation of additional software tools. It also provides a static framework upon which dynamic tools can be developed, implemented, and optimized.

http://connjur.uchc.edu/img/newWorkflow.gif

NMR Data Analysis Using CONNJUR's Integration Environment: Data analysis still proceeds through several steps using the various NMR processing tools. However, in contrast with the above workflow, intermediate storage maximizes the efficiency of archival retrieval, decreases the error-rate, and leverages the database's built-in functionality for format conversion for data exchange.


This website describes such an integrated, workbench environment called CONNJUR as an open source initiative for the biomolecular NMR community. CONNJUR employs a standard three-tier architecture, composed of a relational database back-end, an application layer for wrapping existing NMR processing tools, and a front-end user interface. CONNJUR is coded in JAVA to support the contributions of a wide range of developers and capitalizes on other open source software for its development (e.g. using MySQL as the relational database). The purpose of CONNJUR is to provide a workbench environment from which most NMR data processing can be coordinated. Integrating NMR software tools with a common, relational database will ensure data integrity, provide user guidance and improve efficiency more than is currently the case using the various, available software tools independently.

http://connjur.uchc.edu/img/architecture2.gif

CONNJUR Architecture: Patterned off of the standard three-tier architecture, CONNJUR is built of three main layers: (1) an interface for user interaction, (2) a middle layer which both wraps the third-party software tools, their business logic and provides communication with (3) the third layer, a relational database management system. This separation of layers allows for alterations to one portion (for instance changing the database application) without extensive code revisions to the other layers. The boxes represent the concept of 'actors', which are modular computational units which transparently invoke third-party software to accomplish discrete conceptual tasks. The loose coupling between actors allows NMR spectroscopists to code and implement their own actors with little (or no) knowledge of the application code as a whole. This will, in turn, facilitate the rapid development of additional functionality for CONNJUR.



Enhanced Communication


When using GUI-driven software for spectral reconstruction, the only mechanism for sharing processing strategies is to describe the procedure stepwise, including all of the logic employed in the selection of the order of subprocesses and their configuration. This is equivalent to hands-on training. Script-based software allows a more sophisticated level of communication, as researchers can share their processing scripts<sup>7</sup>, in which the author's logic is hardcoded. Unfortunately, although the logic for any one script is built in, one requires extensive documentation to make adjustments to the script for different types of spectra. A script generator (http://sbtools.uchc.edu) can be used to further encapsulate the logic of spectral reconstruction, but suffers from the compromise between flexibility and usability. A script generator with too many options ceases to be useful.

CONNJUR provides an ideal environment for communication. Business logic for spectral reconstruction can be coded into the actors at any level of detail desired - such that the processing workflow is constructed independent from the spectral details, such as filename, number of points, order of dimensions, etc. As the processing workflows are modeled and stored inside the CONNJUR relational database, it will be an easy future development to provide for their import/export as XML.


Free and Open Source


CONNJUR will be distributed as Free and Open Source Software (FOSS). This licensing mechanism has many concrete benefits to the end user:

  • Price: FREE to download, install and use
  • Extensibility: No restrictions or royalties to modify or resell
  • Transparency: Source code readily inspected and verified
  • Active Development: FOSS encourages the scientific community to aid in the development of CONNJUR
  • Perpetual: As FOSS, CONNJUR is not dependent on the original developers. If we are unable to continue the project
    any member of the NMR community is free to continue developing and distributing the software

How can a software application be considered free if the supporting hardware and software are prohibitively expensive? In the spirit of Open Source, CONNJUR is designed for interoperability with other free and/or Open Source projects as demonstrated with the prototype:

  • Operating System: Linux
  • Framework Programming Language: Java
  • Database: MySQL
  • Integrated Development Environment: Eclipse


References



<li style="width: 75%;">Vranken, et al. (2004). The CCPN data model for NMR spectroscopy: Development of a software pipeline. Proteins: Structure, Function and Bioinformatics, 59, 687-696. <li style="width: 75%;">Fox-Erlich, et al. (2004) Delineation and analysis of the conceptual data model implied by the "IUPAC Recommendations for Biochemical Nomenclature". Protein Science, 13, 2559-2563. <li style="width: 75%;">Baran, et al. (2006) SPINS: A laboratory information management system for organizing and archiving intermediate and final results from NMR protein structure determinations. Proteins: Structure, Function, and Bioinformatics, 62, 843-851. <li style="width: 75%;">Ellis, et al. (2006) Development of an Integrated Framework for Protein Structure Determinations: A Logical Data Model for NMR Data Analysis. Published in Proceedings of the Third International Conference on Information Technology, Las Vegas, Nevada, USA <li style="width: 75%;">Delaglio, et al. (2005) NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR. 6, 277-293. <li style="width: 75%;">http://www.rowland.org/rnmrtk/toolkit.html <li style="width: 75%;">http://www.nanuc.ca/downloads/scripts.php


All times are GMT. The time now is 06:47 PM.

Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Search Engine Friendly URLs by vBSEO 3.6.0
Copyright, BioNMR.com, 2003-2013