meta data for this page
Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
general:softwares:fcat [2021/10/12 10:43] – created vinh | general:softwares:fcat [2022/11/16 14:22] (current) – [fCAT score modes] vinh | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | blablabla | + | ====== fCAT ====== |
+ | |||
+ | One of the critical steps in a genome sequencing project is to assess the completeness of the predicted gene set. The standard workflow starts with the identification of a set of core genes for the taxonomic group, in which the target species belongs to. The fraction of missing core genes serves then as a proxy of the target gene set completeness. | ||
+ | |||
+ | [[https:// | ||
+ | |||
+ | fCAT outputs both the summary result in a tabular text file and the phylogenetic profile of the core genes, which can be visualized using the tool [[https:// | ||
+ | |||
+ | {{: | ||
+ | ====== Table of Contents ====== | ||
+ | |||
+ | * [[# | ||
+ | * [[# | ||
+ | * [[# | ||
+ | * [[# | ||
+ | * [[# | ||
+ | |||
+ | ====== How to install ====== | ||
+ | |||
+ | //fCAT// tool is distributed as a python package called //fcat//. It is compatible with [[https:// | ||
+ | |||
+ | You can install //fcat// using '' | ||
+ | |||
+ | '' | ||
+ | |||
+ | or, in case you do not have admin rights, and don't use package systems like [[https:// | ||
+ | |||
+ | '' | ||
+ | |||
+ | and then add the following line to the end of your **~/ | ||
+ | |||
+ | '' | ||
+ | |||
+ | ====== Usage ====== | ||
+ | |||
+ | The complete process of //fCAT// can be done using one function '' | ||
+ | |||
+ | '' | ||
+ | |||
+ | where **eukaryota** is name of the fCAT core set (equivalent to [[https:// | ||
+ | |||
+ | ====== Output ====== | ||
+ | |||
+ | You will find the output in the /// | ||
+ | |||
+ | * // | ||
+ | * // | ||
+ | * // | ||
+ | * // | ||
+ | * // | ||
+ | |||
+ | Besides, if you have already run //fCAT// for several query taxa with the same fCAT core set, you can find the merged phylogentic profiles for all of those taxa within the corresponding core set output (e.g. /// | ||
+ | |||
+ | To learn how to interpret the phylogenetic profiles using PhyloProfile, | ||
+ | |||
+ | ====== fCAT score modes ====== | ||
+ | |||
+ | The table below explains how the //specific ortholog group cutoffs// for each fCAT core set were calculated, and which //value of the query ortholog// is used to assess its completeness, | ||
+ | |||
+ | ^ Score mode ^ Cutoff ^ Value used for comparing ^ | ||
+ | | Mode 1 - Strict mode | Mean of FAS scores between all core orthologs | Mean of FAS scores between query ortholog and all core proteins | | ||
+ | | Mode 2 - Reference mode | Mean of FAS scores between refspec and all other core orthologs | Mean of FAS scores between query ortholog and refspec protein | | ||
+ | | Mode 3 - Relaxed mode | The lower bound of the confidence interval calculated by the distribution of all-vs-all FAS score in a core group | Mean of FAS scores between query ortholog and all core proteins | | ||
+ | | Mode 4 - Length mode | Mean and standard deviation of all core protein lengths | Length of query ortholog | | ||
+ | |||
+ | {{: | ||
+ | |||
+ | //Note: **FAS scores** are bidirectional FAS scors; **core protein** or **core ortholog** is protein in the core ortholog groups; **query protein** or **query ortholog** is ortholog protein of query species; **refspec** is the specified reference species// | ||
+ | |||
+ | ====== Contact ====== | ||
+ | |||
+ | For further support or bug reports please contact: tran@bio.uni-frankfurt.de |