Skip to content

Latest commit

 

History

History
326 lines (226 loc) · 27 KB

CHANGELOG.md

File metadata and controls

326 lines (226 loc) · 27 KB

Changelog

0.5.0 (2025-01-24)

Continuous Integration

  • pre-commit reformatting (4834ee0)

Miscellaneous Chores

Documentation

Features

  • detect AMR-determining SNPs in promoters if specified (582ea7a)
  • estimate cellular copy numbers by calculating coverage across the longest read for each AMR gene, then normalise by mean read depth across core genes (68566de)
  • genotype novel SNPs in promoters (ba8e654)

Bug Fixes

  • check for float (0eef5b2)
  • detect presence of causative SNPs without full allele matches] (7512592)
  • estimate mean depth only from core genes (49006e4)
  • fastq file path fix (52a6fbb)
  • fix adding promoter results to existing df (2a55d4f)
  • fix adding rpomoter results to existing df (8fa1c92)
  • only add promoters to results if non ref (6afa85b)
  • only add promoters to results if non ref (1c30eec)
  • remove depreciate append to df (b0f63e9)
  • remove underscores in read names and remove redundany graph outputs (c28172c)
  • tweak mpileup params (f23bfa2)
  • update graph_operations (921953b)
  • use min coverage of node in path to estimate copy number (d5c1291)
  • use samtools mpileup to estimate cellular copy number to include secondary alignments (440096e)

Styles

Tests

0.4.1 (2024-12-16)

Bug Fixes

  • use node-space path to estimate copy number in context (1f58801)

0.4.0 (2024-12-15)

Miscellaneous Chores

Features

  • use gene-space paths to assign reads (b30ede6)

Bug Fixes

  • add singleton paths (d7fe631)
  • apply new condition to gene-space subpath generation (2fa0755)
  • prevent checking of redundant path sublists (f79924e)
  • prevent combinatorial increase of gene-space sublists of paths. Multi-process with mp.pool (0afcba5)
  • prevent removal of small, high coverage components (870a07e)

Tests

  • update unittest (e211da2)
  • update unittests to reflect changes (f491c92)

0.3.1 (2024-11-18)

Build System

  • deps: bump tqdm from 4.64.1 to 4.66.3 (b4d9b4f)
  • deps: bump tqdm from 4.64.1 to 4.66.3 (ee877f1)

0.3.0 (2024-11-18)

Miscellaneous Chores

Documentation

  • add citation placeholder (0e1168e)

Features

  • use suffix trees to obtain all paths that span from the start of a block of AMR genes to the end (db6eb3c)

Bug Fixes

  • supplement full paths with singleton nodes (7f6e028)

Code Refactoring

  • import modularity of graph operations (a2ebde5)
  • improve modularity of read operations (425370a)
  • improve modularity of result operations (747276f)

Styles

  • pre-commit reformatting (580064a)

Tests

0.2.0 (2024-11-06)

Build System

Documentation

Features

Bug Fixes

  • add version to init and pyproject (eeecf4f)
  • bypass subsampling from dictionary (ac7604e)

Styles

  • pre-commit recommendations (6b808e1)

0.1.0 (2024-11-06)

Continuous Integration

Documentation

Features

  • add closest reference allele header to read clustering keys for evaluation (5967028)
  • add dynamic determination of node threshold (307744c)
  • add partial untittests for new functions (4eb534b)
  • add source direction to gml (63d3b38)
  • applies a fixed path threshold for correction then uses minimizer containment of paths to correct high coverage paths in the final round. (824bec0)
  • correct bubbles until a maximum of 10 iterations (7d4c87e)
  • dynamic determination of post-bubble popping node filtering thresholds (e84bc97)
  • dynamically choose a value for k (dfee11a)
  • filters alleles that are not >=90% similar to a reference allele and polishes reference alleles to obtain Amira allele (9d3e0e4)
  • initial attempts to correct all bubbles at once (cd5fbae)
  • initial attempts to correct all bubbles at once (15c40cd)
  • kind of working correction method (e79d6cf)
  • mulit-processed DFS of paths and adds minimizer comparison of paths for accurate correction (9718b80)
  • multiprocessed graph building using joblib (c6af7ac)
  • new approach to cluster reads based on paths through the graph (0c36248)
  • output approximate copy numbers and reference allele depths (fbe147b)
  • output tsv of amira results (cbe3c96)
  • output txt file of depths across reference alleles (5128c5c)
  • recover reads from paths that have been filtered out (ef58223)
  • report all equally close variants (d373b7e)
  • take fasta reference file as input and polish closest AMR gene to get nucleotide sequence (3255f28)
  • ten iterations of racon polishing (a81ddce)
  • tracks the position of each gene through each correction step to allow extraction of its sequence. Also removes redundant code and tests and improves code modularity. (5a21e91)
  • use allele sequences to polish pandora consensus instead of entire read (cd07418)
  • use as many adjacent nodes as possible to resolve complex AMR gene paths (ca677c5)
  • use overall mean node coverage for specific k to estimate copy number (ece0037)

Bug Fixes

  • accounts for racon runtime errors, prevents AMR gene bubbles being popped (d0be7f4)
  • add dependencies to pyproject.toml (9ebc9f3)
  • add minimal instructions to README (570e8af)
  • add pandas as dependency (5569a2e)
  • add sourmash as a dependency (edc19a0)
  • add sourmash as a dependency (11a4b4d)
  • apply allele filters to genes from short reads (e3fdc4f)
  • avoids undercalling genes due to collapsed paths (02787dd)
  • bug fix for getting the genes from a list of nodes (6114f75)
  • bug fix for missing gene due to improper sorting (3b79f5a)
  • bug in bubble popping that was missing bubbles (7b1de79)
  • correct relative import of scripts (9986fc8)
  • de novo check gene coverages with minimap2 and do not allow racon to trim AMR alleles (709e0b0)
  • edge case fix for assigning reads to paths and adds a component ID column to the amira tsv (83cc331)
  • edge case in AMR path finding (95889a8)
  • find genes missing from the graph (108d1df)
  • find genes missing from the graph (6713e4d)
  • fix a bunch of bugs that messed with evaluation in real data (a4e76c9)
  • fix for known bug where two different AMR paths overlap, tweaks params to filer out AMR genes and prevents k from being increased when the coverage is too low (70506d1)
  • fixes bug where AMR blocks are missed if they start or end at a junction (c5c0d46)
  • improper handling of edge cases where duplicate genes occur on a read we are trying to correct and/or in a path that we are trying to correct to. (6973ecb)
  • improves modularity of main function and allows JSON inputs (2ed1347)
  • improves modularity of main function and allows JSON inputs (02d15b0)
  • increase coverage threshold for filtering and allow replacement of AMR genes in low coverage paths (a80d980)
  • limit allele coverage to 100 (06ae3e2)
  • minor memory optimisation for minimizer extraction (3dde066)
  • missing AMR path (8e94b44)
  • missing AMR path (b0d64fd)
  • output empty dataframe when no AMR genes are present (6fef804)
  • prevent removal of full components and AMR nodes in dead ends (deea859)
  • reduce AMR allele coverage threshold to 85% (92cca4f)
  • relative import location of fastq functions (4e74456)
  • remove legacy script (3094671)
  • remove redundant print (d9908df)
  • remove threshold for initial AMR path finding (551e090)
  • return empty result when no AMR genes found (659fceb)
  • sort ref alleles first by length then by similarity (e056b56)
  • tweaks parameters to filter AMR genes and ensures subsampling reads does not remove AMR genes completely (e2ff182)
  • undefined variable fix and pre-commit reformatting (9e9c8e0)
  • update to latest pysam (beebe6d)
  • use reads to define start and stop points of paths through AMR blocks (88dcf51)

Styles

  • black reformatting (c2c2ed0)
  • black reformatting (3502cfc)
  • black refotmatting recommendations (b41c635)
  • fix all flake8 style recommendations (13f66f1)
  • flake8 formatting (bd28289)
  • flake8 formatting (0d477de)
  • flake8 formatting (436d5d9)
  • flake8 formatting recommendations (d7d37b8)
  • flake8 formatting recommendations (b02baec)
  • flake8 reformatting (0f9333c)
  • flake8 reformatting (06a72f2)
  • flake8 reformatting (c93b288)
  • flake8 reformatting (da83389)
  • isort and black recommended formatting (51853a6)
  • isort and black recommended formatting (92efeff)
  • partial typing (4abdcd4)
  • pre-commit formatting (c4f18d7)
  • pre-commit recommended formatting (8a097d9)
  • pre-commit reformat (0d6a1f2)
  • pre-commit reformatting (e188bb9)
  • pre-commit reformatting (3ae42d7)
  • reformatting recommendations (e8f2b57)
  • reformatting recommendations (e523293)
  • reformatting using black (451a5b4)
  • remove redundant commented code (cec5de5)
  • rename to Amira (3f4da1c)
  • rename to Amira (7a4a6f5)

Tests

  • add additional test files (f606121)
  • add more complexity in tests (00ef219)
  • add test files (f3f9a51)
  • add unittest for minimizer comparison (4c4934d)
  • add unittest for overlapping AMR blocks (2f69561)
  • remove redundant test (cc79dc0)
  • remove test graph (d308ee3)