Module Detail Information

Type: Module
Short URL:
Description:EMBOSS *chips* calculates Frank Wright's Nc statistic for the effective number of codons used. This is a simple measure that quantifies how far the codon usage of a gene departs from equal usage of synonymous codons. This measure of synonymous codon usage bias, the 'effective number of codons used in a gene', Nc, can be easily calculated from codon usage data alone, and is independent of gene length and amino acid (aa) composition. Nc can take values from 20, in the case of extreme bias where one codon is exclusively used for each aa, to 61 when the use of alternative synonymous codons is equally likely. Nc thus provides an intuitively meaningful measure of the extent of codon preference in a gene. The Nc statistic has problems in very short sequences (20 amino acids or less) which are yet to be fully resolved. They are caused by the need to consider amino acids which are missing in the sequence. This calculation was originally in the EGCG package as "codfish" (codon usage for fission yeast). As Frank Wright is a vegan, we looked for a meat-free name for the EMBOSS version, "chips". The official explanation is "Codon Heterozygosity (Inverse of) in a Protein-coding Sequence" If the sequence extends beyond the coding region then the start and/or end positions of the CDS must be provided because chips analyses exclusively protein coding regions. USAGE: Standard (Mandatory) qualifiers: [-seqall] seqall Nucleotide sequence(s) filename and optional format, or reference (input USA) [-outfile] outfile [*.chips] Output file name Additional (Optional) qualifiers: (none) Advanced (Unprompted) qualifiers: -[no]sum boolean [Y] Sum codons over all sequences Associated qualifiers: "-seqall" associated qualifiers -sbegin1 integer Start of each sequence to be used -send1 integer End of each sequence to be used -sreverse1 boolean Reverse (if DNA) -sask1 boolean Ask for begin/end/reverse -snucleotide1 boolean Sequence is nucleotide -sprotein1 boolean Sequence is protein -slower1 boolean Make lower case -supper1 boolean Make upper case -sformat1 string Input sequence format -sdbname1 string Database name -sid1 string Entryname -ufo1 string UFO features -fformat1 string Features format -fopenfile1 string Features file name "-outfile" associated qualifiers -odirectory2 string Output directory General qualifiers: -auto boolean Turn off prompts -stdout boolean Write standard output -filter boolean Read standard input, write standard output -options boolean Prompt for standard and additional values -debug boolean Write debug output to program.dbg -verbose boolean Report some/full command line options -help boolean Report command line options. More information on associated and general qualifiers can be found with -help -verbose -warning boolean Report warnings -error boolean Report errors -fatal boolean Report fatal errors -die boolean Report dying program messages
Input Parameters:
 - Sequences
 - Start
 - End
 - Reverse
 - Ask
 - Nucleotide
 - Protein
 - Lower Case
 - Upper Case
 - Format
 - Database Name
 - Entry Name
 - UFO Features
 - Features Format
 - Features File
 - Turn Off Prompts
 - Standard Output
 - Filter
 - Options
 - Debug
 - Verbose
 - Help
 - Report Warnings
 - Report Errors
 - Report Fatal Errors
 - Report Dying Program Messages
 - Version
 - No Sum
Output Parameters:
 - Output Directory
 - Output File
File size:26.96 KB
View Source    Download    Open