AFproject interface provides only information about a best performing set of input parameters for a given alignment-free tool.
Full information about the benchmarking results, including all combinations of parameter values of the evaluated tools, can be downloaded from
a0517486dfc53b2a28ac28b42eb9bb4d
The package contains 10,181 tool runs across 12 benchmark results, covering 1,020,463,773 pairwise sequence comparisons.
Path | Description |
---|---|
[dataset_category]/ |
Directory: benchmark category name, e.g. genreg (Regulatory Sequences)
|
[dataset_name]/ |
Directory: data set name (e.g., crm , swisstree ) |
[tool_name]/ |
Directory: Tool name (e.g., FFP , mash , alfpy--canberra ) |
[tool_run]/ |
Directory: Tool run name (e.g., run1 , kmer4 ) |
log.txt |
A log file containing paramater values used in tool run (e.g., k = 4) |
[tool_output].[tsv/phy/newick] |
Raw output file obtained from tool run. |
[tool_output].[tsv/phy/newick].clean |
Clean output file (e.g., with removed unnecessary pairwise comparisons) |
benchmark.json |
JSON file containing benchmark results of tool run (e.g., Robinson-Foulds distance, AUC values) |
summary.json |
JSON file summarizing benchmark results from all tool runs of given tool. The results in the file are ordeded by the obtained performance. First result in the file (best performance) was selected and saved in the AFproject database. |
The full information about AFproject's reference data sets is available for download:
d6d00cbff056c76a992dc7cb421c6fbe
Tha package includes FASTA sequences, translations of sequence identifiers, trusted phylogenetic trees as well as protein and regulatory sequence classifications.