Events News Research CBS CBS Publications Bioinformatics
Staff Contact About Internal CBS CBS Other

RevTrans - Download

Command line RevTrans

RevTrans is written in Python 2.2 and should work on any system (MacOS X, Unix, Windows etc.) with Python 2.2 or newer installed. Please notice that you might need to modify the very first line of the program to point to the location of you python installation.

The program is invoked from the shell (command line), and has a build in help page ( -h).

Download RevTrans v.1.3: revtrans-1.3.tgz
Browser all releases.

RevTrans help page

	revtrans - performs a reverse translation of a peptide alignment

	revtrans dnafile pepfile [-v] [-h] [-gapin chars] [-gapout char] 
	 [-Idna format] [-Ipep format] [-mtx tablename/file] [-match method] 
	 [-O format] [outfile] 

       Reads a set of aligned peptide sequences from pepfile and uses
       the corresponding DNA sequences from dnafile to construct a
       reverse translated version of the alignment.
       By default the input file formats are auto detected and the 
       corresponding DNA and peptide sequences is found by translation.

       In the typical case this means that the user only need to 
       supply the DNA and peptide sequences, and may safely ignore
       the more advanced options. E.g :
	       revtrans kinases.dna.fsa kinases.prot.aln 
       The final alignment is written to STDOUT or outfile if specified,
       and is by default in FASTA format.

		Help. Print this help information.
	-gapin chars
		Specify gap characters in the input sequences.
		Default is '.-~'
	-gapout	char
		Specify which character should be used for gaps in the
		Default is '-'
	-Idna format
		Specify format of the input DNA file.
		Valid formats are: auto (default), fasta, msf and aln

	-Ipep format
		Specify format of the input peptide file.
		Valid formats are: auto (default), fasta, msf and aln

	-O format
		Specify format of the output file.
		Valid formats are: fasta (default), msf and aln
	-mtx tablename/file
		Use alternative translation matrix instead of the build-in
		Standard Genetic Code for translation.
		If "tablename" is 1-6,9-16 or 21-23 one of the alternative 
		translation tables defined by the NCBI taxonomy group will be 
		Briefly, the following tables are defined:
		 1: The Standard Code 
		 2: The Vertebrate Mitochondrial Code 
		 3: The Yeast Mitochondrial Code 
		 4: The Mold, Protozoan, and Coelenterate Mitochondrial Code 
		    and the Mycoplasma/Spiroplasma Code 
		 5: The Invertebrate Mitochondrial Code 
		 6: The Ciliate, Dasycladacean and Hexamita Nuclear Code 
		 9: The Echinoderm and Flatworm Mitochondrial Code 
		10: The Euplotid Nuclear Code 
		11: The Bacterial and Plant Plastid Code 
		12: The Alternative Yeast Nuclear Code 
		13: The Ascidian Mitochondrial Code 
		14: The Alternative Flatworm Mitochondrial Code 
		15: Blepharisma Nuclear Code 
		16: Chlorophycean Mitochondrial Code 
		21: Trematode Mitochondrial Code 
		22: Scenedesmus obliquus mitochondrial Code 
		23: Thraustochytrium Mitochondrial Code 
		See [Genetic Codes]
		for a detailed description.
		If a filename is supplied the translation table is read from
		file instead. 
		The file should contain one line per codon in the format:
		codonaa-single letter code
		All 64 codons must be included. Stop codons is specified 
		by "*". T and U is interchangeable. Blank lines and lines
		starting with "#" are ignored.
		See the "gcMitVertebrate.mtx" file in the RevTrans source
		distribution for a well documented example.
	-match method
		Specify how to match the corresponding DNA and peptide 
		sequences. Valid methods are: trans (default), name and pos.
		Please note that both DNA and peptide sequence should have 
		unique names, regardless of the matching method. 
			Match sequences by translation. The DNA sequences are
			translated using the standard genetic code (default)
			or an alternative translation matrix if the -mtx
			option is used.
			Match sequences by name. Please note that for FASTA 
			files everything after the ">" is considered the
			sequence name. 
			Match by position. The sequence are matched by position
			in the files (first DNA sequence with first peptide
			sequence etc.).			
		Verbose. Print extra information about files, sequences
		and the progress in general to STDERR.
		The verbose level can be set to three degrees of
		-v:   verbose level 1
			Info about files, number of sequences read etc.
			Use this as the first try if something needs
		-vv:  verbose level 2
			As level 1 + 
			Print detailed info about all the sequence names.
		-vvv: verbose level 3
			As level 2 +
			Do a sanity check on the degapped length of the
			sequences. Warn if the sizes do not match.
	Rasmus Wernersson,
	September 2002, February 2003, July 2004