Cluster Flow
Function | A pipelining tool to automate and standardise bioinformatics analyses on cluster environments. |
---|---|
Language | Perl |
Requirements | Optional GridEngine, LSF or Slurm cluster |
Code Maturity | Beta - Being used in production, but still under active development. |
Code Released | Yes, under GPL v3 or later. |
Initial Contact | Phil Ewels |
Go to Cluster Flow websiteDownload Now |
Usage / Installation Tutorial
How to configure and run Cluster Flow
Cluster Flow is a command-line program which uses GRIDEngine or LSF cluster environments to run analysis pipelines.
Benefits of using Cluster Flow:
- Routine analyses are very quick to run, for example:
cf --genome GRCh37 fastq_bowtie *fq.gz
- Pipelines use identical parameters, standardising analysis and making results more reproducable
- Integrated parallelisation tools help prevent your cluster becoming overloaded
- All commands and output is logged in files for future reference
- Intuitive commands and comprehensive documentation make Cluster Flow easy to use
- Requires minimal setup time
How Cluster Flow differs from other pipeline tools:
- Very lightweight and flexible
- Pipelines and configurations can easily be generated on a project-specific basis if required
- New modules and pipelines are very easy to write
Documentation
Cluster Flow has a website with extensive documentation: http://clusterflow.io