New features#

The following are changes cover everything from antiSMASH 6.0 to antiSMASH 7.0.

Major#

Adenylation domain predictions

Adenylation domain predictions for NRPS clusters have been updated with a much larger set of known domains from MIBiG 3.1, using signature based methods. NRPSPredictor2 SVM/machine learning models are still in use, but used by the newer prediction tool instead of NRPSPredictor itself.

CompaRiPPson

This module compares ribosomally synthesised and post-translationally modified peptide (RiPP) precursors to known precursors from the MIBiG 3.1 database, along with predicted precursors from the antiSMASH database.

For more detailed information, see CompaRiPPson.

Gene table

Adds a filterable and interactable table to the HTML output.

Genes can be filtered by entering a search term in the search box (plain text and regex are both supported). Genes that match the filter will be shown in the region view and, if enabled, the view will automatically zoom to the selection. Information used for filtering currently includes the name of the gene, its biosynthetic type, and gene function annotations (e.g. SMCoG hits).

Genes can also be interacted with in the table in the same way as in the region view.

TFBS finder

A module designed to find transcription factor binding sites, using the LogoMotif profiles. The module can be enabled on the command line with --tfbs.

For more detailed information, see the TFBS page.

Minor#

Detection rules

  • Added: 2-deoxy-streptamine aminoglycosides, aminopolycarboxylic acid metallophores, crocagins, methanobactins, mycosporines, NRP-metallophores, opine-like metallophores, phosphonate-likes, fungal-RiPP-likes, RCDPS
  • Updated: phosphonate

For more detailed information, see the glossary.

Analysis:

  • New NRPS/PKS domains, including carnitine-AT (cAT), product template (PT), and thiocysteine/beta-eliminating lyase (SH) (see the full list for details)
  • Improved performance and accuracy of unannotated precursor detection in RiPP modules
  • Updated ClusterCompare database to MIBiG 3.1
  • Clusterblast known database updated to MIBiG 3.1
  • Added transATor predictions for Trans-AT PKS domains

Output:

  • Protoclusters with a cutoff larger than neighbourhood are now marked as being on a contig edge when relevant
  • NRPS/PKS monomer predictions are no longer hidden when no polymer predictions are made

Visualisation:

  • TIGRfam domains within CDS features are now shown in a similar fashion to the existing Pfam and NRPS/PKS domains.
  • An additional NRPS/PKS module visualisation focused on domain layout and predicted module ordering (see here)
  • Updated KS and Condensation domain NaPDoS links to use NaPDoS v2

Other:

  • Detection rules can now use dynamic profiles, defined with python code instead of pHMMs
  • MUSCLE has been removed as a binary dependency

Fixes and small changes#

Analysis:

  • Improved detection of cross-CDS NRPS/PKS modules
  • Improved non-colinear order finding of pure PKS clusters
  • Deconflicted newly found RiPP precursors when multiple types overlapped

Input handling:

  • Improved handling of splice variants
  • Long contig/record names are no longer truncated by default (previous behaviour can be used with --no-allow-long-headers)
  • Handled some issues when running gene finding over inputs with very large gaps
  • Improved handling of various kinds of corrupted/invalid feature annotations in Genbank inputs

Output:

  • Updated all Pfam links to the new EBI website

Visualisation:

  • Unified all RiPP precursor sequence visualisations

Other:

  • Improved various parts of the database download and preparation processes
  • Improved file size optimisation for HTML and javascript, for better loading times on slower connections
  • Replaced all FTP database download URLs with HTTPS, to better support restricted install environments

Numerous other small changes and fixes were made internally, for a full list see the git shortlog.