The Recombination Analysis Tool (RAT) is a cross-platform, Java-based software application designed for the high-throughput detection of recombination events in multiple sequence alignments. Developed by Graham J. Etherington, Jo Dicks, and Ian N. Roberts in 2005, it allows researchers to find genetic crossovers across varying sequence lengths, ranging from short viral fragments to entire viral genomes. Core Mechanism and Algorithm
Distance-Based Method: RAT relies entirely on a distance-based method, utilizing pairwise sequence comparisons rather than complex phylogenetic tree reconstruction.
Sliding Window Approach: The tool operates by moving a “sliding window” of a user-defined length along a multiple sequence alignment at specific increments.
Distance Scoring: Inside each window, every nucleotide or amino acid of a designated test sequence is cross-compared to the corresponding position in all other sequences to calculate an absolute distance score.
Optimization Rule: While parameters can be adjusted manually, the general rule of thumb for optimization is to set the window size to 10% of the total sequence length, with an increment size equaling half of the window size. Key Features
Format Flexibility: RAT can evaluate both DNA (nucleotide) and protein (amino acid) alignments. It accommodates seven distinct alignment file formats, though FASTA format is generally preferred.
Search Modes: Users can manually inspect individual lines via the Single-sequence viewer, or run automated batch analysis through the built-in Auto Search option.
Graphical Output: The software maps out crossover points visually, charting genetic distance on the y-axis against the linear position of the sequence on the x-axis.
Export Options: Numerical data can be downloaded into text files, spreadsheets (.xls, .csv), and graphics can be exported directly as .jpg images. Practical Applications
RAT has been heavily used in the field of virology, where rapid mutation and genetic recombination drive evolution. Historically, its utility was demonstrated through the identification of known recombinant strains of HIV-1 and the localized mapping of recombination “hot spots” within Noroviruses (specifically at the junction between ORF1 and ORF2).
For those looking to explore the software or review the source code, you can find the repository directly on the RAT GitHub Page. Are you looking to analyze a specific dataset using RAT, or ethering/RAT – GitHub
Leave a Reply