User:Markds/BioSLAX

BioSLAX is a Live CD/Live DVD/Live USB comprising a suite of more than 30 bioinformatics tools and application suites. It has been released by the Bioinformatics Resource Unit of the Life Sciences Institute (LSI), National University of Singapore (NUS) and is bootable from any PC that allows has a CD/DVD or USB boot option and runs the compressed Slackware flavour of the LINUX Operating System (OS), also known as SLAX. SLAX was created by Tomáš Matějíček in the Czech republic using the Linux Live Scripts which he also developed. The BioSLAX derivative was created by Mark De Silva, Lim Kuan Siong and Tan Tin Wee.

BioSLAX was first released to the NUS Life Science Curriculum in April 2006.

History
In January 2003, APBioNet received a research grant from the Pan Asia Networking (PAN) Programme of IDRC (Canada) to build an APBioBox of commonly used bioinformatics applications and packages with grid-computing software as part of its effort to build an APBioGrid. The platform chosen was the then ubiquitous Redhat Linux. In March of that same year, APBioNet launched an industry partnership scheme (AIPS) and partnered with Sun Microsystems to build BioBox for the Solaris platform. Six months later, beta versions of APBioBox and Sun's biobox, now called Bio-Cluster Grid were released for beta testing among selected parties. The packages included Globus Grid Toolkit Version 2.0 and Sun Grid Engine respectively.

On 4th December 2003, the biobox software packages now named APBioBox (Redhat Linux) and BioCluster Grid (Sun Solaris) were field-tested at a Bioinformatics Workshop was conducted at the Advanced Science and Technology Institute (ASTI), Department of Science and Technology (DOST), Philippines on the occasion of the 70th Anniversary of the National Research Council of the Philippines (NRCP). Ten pentium machines and a couple of Sun servers were successfully inducted into the APBioGrid. This Workshop and the software tested were sponsored by Sun Microsystems and partially funded by IDRC.

In July 2004, Dr Derek Kiong introduced Knoppix as a stable, powerful and small-footprint Unix (Debian-based) platform to A/Prof Tan Tin Wee in a workshop organised by the Institute of Systems Science (ISS), NUS. By September 2004, through Mr Ong Guan Sin, we were able to create a Knoppix remaster template by building software in APBioBox plus usefuly applications into a prototype, APBioKnoppix, as a project for the practical course of LSM2104 module of the Dept of Biochemistry, NUS. It was subsequently upgraded based on Knoppix 4.02 and released as APBioKnoppix2. While APBioKnoppix was widely used, it was found that it was not easily expandable. All applications had to be inplace prior to remastering and this made the distribution highly inflexible.

In June of 2005, Mr. Mark De Silva, suggested using SLAX as a base for a new bio based live cd. Because SLAX worked by overlaying "application modules" on top of the base Linux OS, it made the entire distribution modular. The additional functionality of deploying these modules even while the system was already running, made using SLAX even more appealing.

By April 2006, the first version of BioSLAX was released with several editions:


 * Standard User Edition (530 MBytes)
 * Developer Edition (700 MBytes)
 * Sever Edition (470 MByte)

BioSLAX was subsequently used in the bioinformatics teaching module within NUS under the Life Science Curriculum as well as in several events that were organized under the umbrella of the Asia Pacific Bioinformatics Network (APBioNet). APBioNet is a regional affiliate of the International Society for Computational Biology (ISCB). Customized versions were built to cater for both NUS and APBioNet.

In August 2007, in collaboration with the APBioNet, a customized BioSLAX was used to set up the Bioinformatics Resource Node of Vietnam at Bio-IBT, the Bioinformatics Resource Server of the Institute of Biotechnology, Vietnam Academy of Science and Technology, Hanoi, Viet Nam. The Bio-IBT node offered :


 * BioMirrors repository of biological databases
 * NCBI BLAST mirrored resource
 * Web access to EBI EMBOSS applications
 * Web access to CLUSTALW multiple sequence alignment
 * Web access to the T-Coffee multiple sequence alignment
 * Web access to the PHYLIP Phylogenetic Inference Package
 * Web access to the Sequence Manipulation Suite, SMS2

Users with SSH access to the server also had access to many more command line based bio/life science applications.

The entire project was done in collaboration with the 1st UNESCO-IUBMB-FAOBMB-APBioNet Bioinformatics Workshop in Vietnam, held 20th to 31st of August 2007, a satellite event of the 6th International Conference on Bioinformatics (InCoB) 2007 at HongKong, Hanoi and Nansha.

Some versions of BioSLAX deployed in international instituitions under APBioNet were fitted with a small tool which allowed them to map their IPs to a dynamically created apbionet.org domain name, hence giving each machine a fully qualified domain name (FQDN) and presense on the Internet.

Versions
To date, there have been two versions of BioSLAX - BioSLAX 5.x based on SLAX 5 and BioSLAX 7.x based on SLAX 6. While BioSLAX 5.x followed the version numbers of SLAX 5, BioSLAX 7 adopted a new version numbering which is one higher than the SLAX version it is based on. Latest versions can be downloaded from the BioSLAX website.

BioSLAX 5.x
BioSLAX 5.x was largely based on the 5.1.8 version of SLAX, running earlier versions of the 2.6 Linux kernel and KDE 3.4, with unionfs.

Standard User Edition
This edition runs the KDE X-Windows GUI and comes with all the tools and application suites, but does not include any compiler tools nor the Linux kernel source code and headers. This is mainly suited for users who only need to use the tools and applications suites. It has a very small size, making it easy to download and particularly convenient for regions where internet bandwidth is an issue.

Developer Edition
This edition runs the KDE X-Windows GUI and comes with all the tools and application suites and also includes a full set of development and compiler tools and also including the Linux kernel source code and headers. This is edition is more for the power user, who, in addition to using the various tools and applications, might want to also compile new applications or create new application modules for BioSLAX.

Sever Edition
This edition does not include any X-Windows GUI, compilation tools, Linux kernel source or kernel headers. It is primarily meant to be used as a remote server, where users have to either SSH in to use the command line applications or connect to the server via the web to access the available web-based portals to popular bio applications.

NUS LSM Edition
This edition is the Developer Edition, customized for use by the NUS Life Science Curriculum for the teaching of bioinformatics.

Taverna Edition
This edition is the Developer Edition which includes TaveRNA. The TaveRNA Project aims to provide a language and software tools to facilitate easy use of workflow and distributed compute technology.

BioSLAX 7.x
BioSLAX 7.x is based on SLAX 6 and features the later releases of the 2.6 Linux kernel, KDE 3.5 and using aufs and lzma compression. The biggest change is the use of this version as either client or server. The distribution was also moved from CD to DVthe teaching of bioinformatics.

Taverna Edition
This edition is the Developer Edition which includes TaveRNA. The TaveRNA Project aims to provide a language and software tools to facilitate easy use of workflow and distributed compute technology.

BioSLAX 7.x
BioSLAX 7.x is based on SLAX 6 and features the later releases of the 2.6 Linux kernel, KDE 3.5 and using aufs and lzma compression. The biggest change is the use of this version as either client or server. The distribution was also moved from CD to DVD, allowing for more applications to be introduced, which were previously left out ofedition is the Developer Edition which includes TaveRNA. The TaveRNA Project aims to provide a language and software tools to facilitate easy use of workflow and distributed compute technology.

BioSLAX 7.x
BioSLAX 7.x is based on SLAX 6 and features the later releases of the 2.6 Linux kernel, KDE 3.5 and using aufs and lzma compression. The biggest change is the use of this version as either client or server. The distribution was also moved from CD to DVD, allowing for more applications to be introduced, which were previously left out of version 5.x due to space considerations. The ability to boot from a FAT formatted USB drive was also introduced in SLAX 6, hence BioSLAX 7.x versions also had this feature.

Standard Tools
BioSLAX features the Linux Slackware 12.1 operating syste6 and features the later releases of the 2.6 Linux kernel, KDE 3.5 and using aufs and lzma compression. The biggest change is the use of this version as either client or server. The distribution was also moved from CD to DVD, allowing for more applications to be introduced, which were previously left out of version 5.x due to space considerations. The ability to boot from a FAT formatted USB drive was also introduced in SLAX 6, hence BioSLAX 7.x versions also had this feature.

Standard Tools
BioSLAX features the Linux Slackware 12.1 operating system and standard useful applications such as:


 * PERL
 * PHP
 * Apache 2
 * MySQL
 * Openoffice
 * KPDF Reader
 * Mozilla Firefox
 * Mozilla Thunderbird
 * gFTP
 * ProFTPd
 * Open SSH
 * Kopete Instant Messenger
 * VNC Viewer
 * Remote Desktop Services

BioInformatics Tools
For bioinformatics tools and applications, they are subdivided into three main categories.

Console Apps

 * Blast
 * BlastCL3
 * BioGrep
 * ClustalW
 * EMBOSS
 * Genesplicer
 * GlimmerHMM
 * HMMER
 * Modeller
 * PamL
 * Phylip
 * Primer3
 * R programming language & Bioconductor
 * T-Coffee

Desktop Apps

 * ACT
 * Artemis
 * ClustalX (GUI Based ClustalW)
 * JAligner
 * Jalview
 * jEMBOSS (Java EMBOSS Suite)
 * Jmol
 * NJPlot
 * Pymol
 * ReadSEQ
 * TreeView
 * Weka (machine learning)

Web Apps

 * Web Blast
 * Web ClustalW
 * Web Phylip
 * Web T-Coffee
 * wEMBOSS (Web based EMBOSS suite)
 * Sequence Manipulation Suite (SMS)

BioSLAX Updates
BioSLAX will be updated as newer Slackware (or SLAX) versions are released. The tools and applications suites will also be monitored for significant changes and upgraded as necessary. Some tools may be removed to make way for other tools which can do the same thing but with added functionality and better efficiency. More web based portals are being looked at, for example, portals to ReadSeq, Primer3 and Genesplicer are in the pipeline.

Grid Deployment
The developers are currently looking at integrating various Grid computing platforms with BioSLAX. Because BioSLAX can be booted up immediately from any CD/DVD/USB, it can be used as a rapidly deployable Grid-enabled Operating System.

Using the Univa UD Grid mpagent, early testing shows that the agent, once modularized on BioSLAX, can be used to Grid enable machines from any location as clients to a master network, effectively creating an "global-wide grid". The developers are now looking at how BioSLAX can integrate the EGEE Grid platform for rapid scaleable expansion of grid nodes for bioinformatics and high throughput computational analysis of lipidomics data.