際際滷

際際滷Share a Scribd company logo
To provide a genomic narrative that can be trusted, microbiology
laboratories need quality control (QC) metrics to accompany their
genomic pipelines. QC metrics enable:
≒ Implementing standards in routine lab sample processing
≒ Performance comparison of pipeline optimizations or alternatives
≒ Retrospective tracing of problems that arise
QC metrics are not easy to implement  they may need to be adjusted for
organism type, sample quality, sequencing technology and preparation,
and the mix of software components that are brought together in a
pipeline. Another challenge is to transform QC reporting from a manual
review of a pipelines disparate and often opaque application log files,
into an automated system of reporting and decision making that can be
adjusted by researchers and system administrators who are not expert
programmers.
We have developed a general purpose text-mining and reporting
application called Report Calc for Quality Control (RCQC) that works
directly within command-line scripts, or as a tool in Galaxy (an interactive
bioinformatics platform and workflow engine). An RCQC interpreter
follows instructions in a RCQC script to extract QC variables from various
application log and report files. It can implement rules that trigger
warning or failure statuses in an active pipeline. Various opportunities
arise for metrics along the stages of a genomic pipeline; our initial focus
is on basic assembly metrics as illustrated on this poster.
Abstract
RCQC Recipes
QC Ontology
Using the JSON-LD formats metadata feature, RCQC can link particular
QC report terms to their standardized ontology counterparts. Creating a
controlled vocabulary for QC enables reports from disparate genomic
pipelines to be compared, which should eventually lead to a set of
pipeline metrics for accrediting commercial, government and open source
software. Within the context of the OBOFoundry of ontologies we are
introducing an ontology called GenEpiO (currently available at
https://github.com/Public-Health-Bioinformatics/irida_ontology) which
holds QC terms like "genome size ratio", contig count, etc. Using the
Prot辿g辿 ontology editor it is easy to see the definitions for these terms.
Acknowledgements
IRIDA project funding is provided by Genome Canada, Genome BC, and
the Genomics R&D Initiative (GRDI) with additional support from Simon
Fraser University and Cystic Fibrosis Canada. We thank additional
project advisors for constructive comments.
We have started a library of simple "recipe" scripts that extract quality
control (QC) data from various reports like FastQC, QUAST, CheckM and
SPAdes into the popular and software-friendly JSON format (an auto-
generated HTML version of the same content is also available). One can
override sections of an RCQC recipe with settings that test variations in a
pipeline job. An example RCQC text-mining script and output HTML and
JSON report is shown below along with typical report files from other
pipeline tools.
1Department of Pathology, University of British Columbia; 2National Microbiology Laboratory, Public Health Agency of Canada; 3Department of Pathology,
University of British Columbia & BC Public Health Microbiology and Reference Laboratory
Damion M. Dooley1; Aaron J. Petkau2; Franklin Bristow2;
Gary Van Domselaar2; William W.L. Hsiao3
A Scripting Language For Standardized Evaluation Of Quality
Metrics In Galaxy And Command-line Driven Workflows
This work stemmed from the plan to enhance QC reporting on the web-
based Integrated Rapid Infectious Disease Analysis (www.IRIDA.ca)
project which manages sequence libraries and pipelines for food-born
pathogen assembly, annotation, SNP detection, and phylogenetic
analysis. RCQC has been developed to work as a command-line python
app, but in addition, since IRIDA uses Galaxy to execute its pipeline, we
have a Galaxy RCQC tool for pro users to develop recipes. We will be
offering a basic version of this tool that allows users without programming
skills to adjust key QC parameters only.
Recipes can include conditionals that trigger a halt to a pipeline by
sending the appropriate signal (exit code). More than one RCQC recipe
can be run in a pipeline, and their report output can be daisy chained in
order to contribute to a single collective report. QC metric conditionals
shown below can either signal a possible error situation (the fail(qc)
call), or even call a halt to futile pipeline work (via fail(job )).
adjusting parameters and formulae for pipeline operation  one that did
not require recompilation after each user-driven change. As a result, the
RCQC system provides a more transparent rule set that reduces the skill
needed to make process adjustments. Standard assembly pipeline QC
metrics are introduced which provide a blueprint for the way QC
components could be shared amongst NGS sequencing pipelines.
Further information, including source code, is available at
https://github.com/Public-Health-Bioinformatics/rcqc.
Implementation
Protege ontology editor view of GenEpiO assembly quality control terms
JSON-LDHTML
FLASHFastQC
CheckM
RCQC recipe for text-mining flash.log
In developing a scripting language to
do this work, we did not want to
reinvent the wheel (in fact RCQC offers
up for reuse all of pythons built-in
math and operator functions). We did
however need a flexible mechanism for
FLASH
Ad

Recommended

Scale
Scale
Miguel Lopez
Kemwell Analytical presentation
Kemwell Analytical presentation
MithaliRosario
Building Efficient Software with Property Based Testing
Building Efficient Software with Property Based Testing
CitiusTech
ICH GUDLINES
ICH GUDLINES
GRamesh15
05_Microbio590B_QC_2022.pdf
05_Microbio590B_QC_2022.pdf
Kristen DeAngelis
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomi...
Tom Connor
IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...
IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...
William Hsiao
IRIDA: Canadas federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canadas federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA_community
Best Practices for Building an End-to-End Workflow for Microbial Genomics
Best Practices for Building an End-to-End Workflow for Microbial Genomics
Jonathan Jacobs, PhD
Irida immemxi hsiao
Irida immemxi hsiao
IRIDA_community
永鴛喝京庄看庄稼韓看姻馨温岳庄界韓一畉n一稼韓韓稼稼艶敬畉n一竪稼温竪稼.沿沿岳恰
永鴛喝京庄看庄稼韓看姻馨温岳庄界韓一畉n一稼韓韓稼稼艶敬畉n一竪稼温竪稼.沿沿岳恰
conmeocon2501
Reproducibility, Quality Control and Importance of Automation
Reproducibility, Quality Control and Importance of Automation
QIAGEN
20120907 microbiome-intro
20120907 microbiome-intro
Leo Lahti
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
IRIDA_community
Common languages in genomic epidemiology: from ontologies to algorithms
Common languages in genomic epidemiology: from ontologies to algorithms
Jo達o Andr辿 Carri巽o
IRIDA: Canadas federated platform for genomic epidemiology
IRIDA: Canadas federated platform for genomic epidemiology
William Hsiao
2015 06-12-beiko-irida-big data
2015 06-12-beiko-irida-big data
beiko
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
Jo達o Andr辿 Carri巽o
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data Analysis
SANJANA PANDEY
Irida bccdc dec10_2015
Irida bccdc dec10_2015
IRIDA_community
Software Pipelines: The Good, The Bad and The Ugly
Software Pipelines: The Good, The Bad and The Ugly
Jo達o Andr辿 Carri巽o
Emma Griffiths ASM microbe gen_epio_poster
Emma Griffiths ASM microbe gen_epio_poster
IRIDA_community
Activities in development of bioinformatics pipelines for characterisation of...
Activities in development of bioinformatics pipelines for characterisation of...
OECD Environment
Sequencing and Beyond?
Sequencing and Beyond?
Leighton Pritchard
Integrated omics analysis pipeline for model organism with Cytoscape, Kozo Ni...
Integrated omics analysis pipeline for model organism with Cytoscape, Kozo Ni...
Kozo Nishida
Rapid outbreak characterisation - UK Genome Sciences 2014 - wed 3 sep 2014
Rapid outbreak characterisation - UK Genome Sciences 2014 - wed 3 sep 2014
Torsten Seemann
Robertson immemxi final March 2016
Robertson immemxi final March 2016
IRIDA_community
Hetman immem xi final March 2016
Hetman immem xi final March 2016
IRIDA_community
Barker immemxi final March 2016
Barker immemxi final March 2016
IRIDA_community
Emma FoodON poster3
Emma FoodON poster3
IRIDA_community

More Related Content

Similar to Report Calc for Quality Control (18)

Best Practices for Building an End-to-End Workflow for Microbial Genomics
Best Practices for Building an End-to-End Workflow for Microbial Genomics
Jonathan Jacobs, PhD
Irida immemxi hsiao
Irida immemxi hsiao
IRIDA_community
永鴛喝京庄看庄稼韓看姻馨温岳庄界韓一畉n一稼韓韓稼稼艶敬畉n一竪稼温竪稼.沿沿岳恰
永鴛喝京庄看庄稼韓看姻馨温岳庄界韓一畉n一稼韓韓稼稼艶敬畉n一竪稼温竪稼.沿沿岳恰
conmeocon2501
Reproducibility, Quality Control and Importance of Automation
Reproducibility, Quality Control and Importance of Automation
QIAGEN
20120907 microbiome-intro
20120907 microbiome-intro
Leo Lahti
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
IRIDA_community
Common languages in genomic epidemiology: from ontologies to algorithms
Common languages in genomic epidemiology: from ontologies to algorithms
Jo達o Andr辿 Carri巽o
IRIDA: Canadas federated platform for genomic epidemiology
IRIDA: Canadas federated platform for genomic epidemiology
William Hsiao
2015 06-12-beiko-irida-big data
2015 06-12-beiko-irida-big data
beiko
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
Jo達o Andr辿 Carri巽o
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data Analysis
SANJANA PANDEY
Irida bccdc dec10_2015
Irida bccdc dec10_2015
IRIDA_community
Software Pipelines: The Good, The Bad and The Ugly
Software Pipelines: The Good, The Bad and The Ugly
Jo達o Andr辿 Carri巽o
Emma Griffiths ASM microbe gen_epio_poster
Emma Griffiths ASM microbe gen_epio_poster
IRIDA_community
Activities in development of bioinformatics pipelines for characterisation of...
Activities in development of bioinformatics pipelines for characterisation of...
OECD Environment
Sequencing and Beyond?
Sequencing and Beyond?
Leighton Pritchard
Integrated omics analysis pipeline for model organism with Cytoscape, Kozo Ni...
Integrated omics analysis pipeline for model organism with Cytoscape, Kozo Ni...
Kozo Nishida
Rapid outbreak characterisation - UK Genome Sciences 2014 - wed 3 sep 2014
Rapid outbreak characterisation - UK Genome Sciences 2014 - wed 3 sep 2014
Torsten Seemann
Best Practices for Building an End-to-End Workflow for Microbial Genomics
Best Practices for Building an End-to-End Workflow for Microbial Genomics
Jonathan Jacobs, PhD
永鴛喝京庄看庄稼韓看姻馨温岳庄界韓一畉n一稼韓韓稼稼艶敬畉n一竪稼温竪稼.沿沿岳恰
永鴛喝京庄看庄稼韓看姻馨温岳庄界韓一畉n一稼韓韓稼稼艶敬畉n一竪稼温竪稼.沿沿岳恰
conmeocon2501
Reproducibility, Quality Control and Importance of Automation
Reproducibility, Quality Control and Importance of Automation
QIAGEN
20120907 microbiome-intro
20120907 microbiome-intro
Leo Lahti
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
IRIDA_community
Common languages in genomic epidemiology: from ontologies to algorithms
Common languages in genomic epidemiology: from ontologies to algorithms
Jo達o Andr辿 Carri巽o
IRIDA: Canadas federated platform for genomic epidemiology
IRIDA: Canadas federated platform for genomic epidemiology
William Hsiao
2015 06-12-beiko-irida-big data
2015 06-12-beiko-irida-big data
beiko
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
Jo達o Andr辿 Carri巽o
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data Analysis
SANJANA PANDEY
Irida bccdc dec10_2015
Irida bccdc dec10_2015
IRIDA_community
Software Pipelines: The Good, The Bad and The Ugly
Software Pipelines: The Good, The Bad and The Ugly
Jo達o Andr辿 Carri巽o
Emma Griffiths ASM microbe gen_epio_poster
Emma Griffiths ASM microbe gen_epio_poster
IRIDA_community
Activities in development of bioinformatics pipelines for characterisation of...
Activities in development of bioinformatics pipelines for characterisation of...
OECD Environment
Integrated omics analysis pipeline for model organism with Cytoscape, Kozo Ni...
Integrated omics analysis pipeline for model organism with Cytoscape, Kozo Ni...
Kozo Nishida
Rapid outbreak characterisation - UK Genome Sciences 2014 - wed 3 sep 2014
Rapid outbreak characterisation - UK Genome Sciences 2014 - wed 3 sep 2014
Torsten Seemann

More from IRIDA_community (10)

Robertson immemxi final March 2016
Robertson immemxi final March 2016
IRIDA_community
Hetman immem xi final March 2016
Hetman immem xi final March 2016
IRIDA_community
Barker immemxi final March 2016
Barker immemxi final March 2016
IRIDA_community
Emma FoodON poster3
Emma FoodON poster3
IRIDA_community
Emma Food on workshop allergy_eg
Emma Food on workshop allergy_eg
IRIDA_community
Biocuration gen epio_poster
Biocuration gen epio_poster
IRIDA_community
Julie Shay CCBC poster may 11 2016
Julie Shay CCBC poster may 11 2016
IRIDA_community
Integrate Ontologies into your apps
Integrate Ontologies into your apps
IRIDA_community
Gen epio immem_griffiths
Gen epio immem_griffiths
IRIDA_community
Grand round whsiao_may2015
Grand round whsiao_may2015
IRIDA_community
Robertson immemxi final March 2016
Robertson immemxi final March 2016
IRIDA_community
Hetman immem xi final March 2016
Hetman immem xi final March 2016
IRIDA_community
Barker immemxi final March 2016
Barker immemxi final March 2016
IRIDA_community
Emma Food on workshop allergy_eg
Emma Food on workshop allergy_eg
IRIDA_community
Biocuration gen epio_poster
Biocuration gen epio_poster
IRIDA_community
Julie Shay CCBC poster may 11 2016
Julie Shay CCBC poster may 11 2016
IRIDA_community
Integrate Ontologies into your apps
Integrate Ontologies into your apps
IRIDA_community
Gen epio immem_griffiths
Gen epio immem_griffiths
IRIDA_community
Grand round whsiao_may2015
Grand round whsiao_may2015
IRIDA_community
Ad

Recently uploaded (20)

Key Challenges in Troubleshooting Customer On-Premise Applications
Key Challenges in Troubleshooting Customer On-Premise Applications
Tier1 app
Download Adobe Illustrator Crack free for Windows 2025?
Download Adobe Illustrator Crack free for Windows 2025?
grete1122g
CodeCleaner: Mitigating Data Contamination for LLM Benchmarking
CodeCleaner: Mitigating Data Contamination for LLM Benchmarking
arabelatso
Enable Your Cloud Journey With Microsoft Trusted Partner | IFI Tech
Enable Your Cloud Journey With Microsoft Trusted Partner | IFI Tech
IFI Techsolutions
From Data Preparation to Inference: How Alluxio Speeds Up AI
From Data Preparation to Inference: How Alluxio Speeds Up AI
Alluxio, Inc.
Top Time Tracking Solutions for Accountants
Top Time Tracking Solutions for Accountants
oliviareed320
Foundations of Marketo Engage - Programs, Campaigns & Beyond - June 2025
Foundations of Marketo Engage - Programs, Campaigns & Beyond - June 2025
BradBedford3
Sysinfo OST to PST Converter Infographic
Sysinfo OST to PST Converter Infographic
SysInfo Tools
Introduction to Agile Frameworks for Product Managers.pdf
Introduction to Agile Frameworks for Product Managers.pdf
Ali Vahed
OpenChain Webinar - AboutCode - Practical Compliance in One Stack Licensing...
OpenChain Webinar - AboutCode - Practical Compliance in One Stack Licensing...
Shane Coughlan
Modern Platform Engineering with Choreo - The AI-Native Internal Developer Pl...
Modern Platform Engineering with Choreo - The AI-Native Internal Developer Pl...
WSO2
AI for PV: Development and Governance for a Regulated Industry
AI for PV: Development and Governance for a Regulated Industry
Biologit
Complete WordPress Programming Guidance Book
Complete WordPress Programming Guidance Book
Shabista Imam
arctitecture application system design os dsa
arctitecture application system design os dsa
za241967
Zoho Creator Solution for EI by Elsner Technologies.docx
Zoho Creator Solution for EI by Elsner Technologies.docx
Elsner Technologies Pvt. Ltd.
ElectraSuite_Prsentation(online voting system).pptx
ElectraSuite_Prsentation(online voting system).pptx
mrsinankhan01
Folding Cheat Sheet # 9 - List Unfolding as the Computational Dual of ...
Folding Cheat Sheet # 9 - List Unfolding as the Computational Dual of ...
Philip Schwarz
Y - Recursion The Hard Way GopherCon EU 2025
Y - Recursion The Hard Way GopherCon EU 2025
Eleanor McHugh
declaration of Variables and constants.pptx
declaration of Variables and constants.pptx
meemee7378
Zonerankers Digital marketing solutions
Zonerankers Digital marketing solutions
reenashriee
Key Challenges in Troubleshooting Customer On-Premise Applications
Key Challenges in Troubleshooting Customer On-Premise Applications
Tier1 app
Download Adobe Illustrator Crack free for Windows 2025?
Download Adobe Illustrator Crack free for Windows 2025?
grete1122g
CodeCleaner: Mitigating Data Contamination for LLM Benchmarking
CodeCleaner: Mitigating Data Contamination for LLM Benchmarking
arabelatso
Enable Your Cloud Journey With Microsoft Trusted Partner | IFI Tech
Enable Your Cloud Journey With Microsoft Trusted Partner | IFI Tech
IFI Techsolutions
From Data Preparation to Inference: How Alluxio Speeds Up AI
From Data Preparation to Inference: How Alluxio Speeds Up AI
Alluxio, Inc.
Top Time Tracking Solutions for Accountants
Top Time Tracking Solutions for Accountants
oliviareed320
Foundations of Marketo Engage - Programs, Campaigns & Beyond - June 2025
Foundations of Marketo Engage - Programs, Campaigns & Beyond - June 2025
BradBedford3
Sysinfo OST to PST Converter Infographic
Sysinfo OST to PST Converter Infographic
SysInfo Tools
Introduction to Agile Frameworks for Product Managers.pdf
Introduction to Agile Frameworks for Product Managers.pdf
Ali Vahed
OpenChain Webinar - AboutCode - Practical Compliance in One Stack Licensing...
OpenChain Webinar - AboutCode - Practical Compliance in One Stack Licensing...
Shane Coughlan
Modern Platform Engineering with Choreo - The AI-Native Internal Developer Pl...
Modern Platform Engineering with Choreo - The AI-Native Internal Developer Pl...
WSO2
AI for PV: Development and Governance for a Regulated Industry
AI for PV: Development and Governance for a Regulated Industry
Biologit
Complete WordPress Programming Guidance Book
Complete WordPress Programming Guidance Book
Shabista Imam
arctitecture application system design os dsa
arctitecture application system design os dsa
za241967
Zoho Creator Solution for EI by Elsner Technologies.docx
Zoho Creator Solution for EI by Elsner Technologies.docx
Elsner Technologies Pvt. Ltd.
ElectraSuite_Prsentation(online voting system).pptx
ElectraSuite_Prsentation(online voting system).pptx
mrsinankhan01
Folding Cheat Sheet # 9 - List Unfolding as the Computational Dual of ...
Folding Cheat Sheet # 9 - List Unfolding as the Computational Dual of ...
Philip Schwarz
Y - Recursion The Hard Way GopherCon EU 2025
Y - Recursion The Hard Way GopherCon EU 2025
Eleanor McHugh
declaration of Variables and constants.pptx
declaration of Variables and constants.pptx
meemee7378
Zonerankers Digital marketing solutions
Zonerankers Digital marketing solutions
reenashriee
Ad

Report Calc for Quality Control

  • 1. To provide a genomic narrative that can be trusted, microbiology laboratories need quality control (QC) metrics to accompany their genomic pipelines. QC metrics enable: ≒ Implementing standards in routine lab sample processing ≒ Performance comparison of pipeline optimizations or alternatives ≒ Retrospective tracing of problems that arise QC metrics are not easy to implement they may need to be adjusted for organism type, sample quality, sequencing technology and preparation, and the mix of software components that are brought together in a pipeline. Another challenge is to transform QC reporting from a manual review of a pipelines disparate and often opaque application log files, into an automated system of reporting and decision making that can be adjusted by researchers and system administrators who are not expert programmers. We have developed a general purpose text-mining and reporting application called Report Calc for Quality Control (RCQC) that works directly within command-line scripts, or as a tool in Galaxy (an interactive bioinformatics platform and workflow engine). An RCQC interpreter follows instructions in a RCQC script to extract QC variables from various application log and report files. It can implement rules that trigger warning or failure statuses in an active pipeline. Various opportunities arise for metrics along the stages of a genomic pipeline; our initial focus is on basic assembly metrics as illustrated on this poster. Abstract RCQC Recipes QC Ontology Using the JSON-LD formats metadata feature, RCQC can link particular QC report terms to their standardized ontology counterparts. Creating a controlled vocabulary for QC enables reports from disparate genomic pipelines to be compared, which should eventually lead to a set of pipeline metrics for accrediting commercial, government and open source software. Within the context of the OBOFoundry of ontologies we are introducing an ontology called GenEpiO (currently available at https://github.com/Public-Health-Bioinformatics/irida_ontology) which holds QC terms like "genome size ratio", contig count, etc. Using the Prot辿g辿 ontology editor it is easy to see the definitions for these terms. Acknowledgements IRIDA project funding is provided by Genome Canada, Genome BC, and the Genomics R&D Initiative (GRDI) with additional support from Simon Fraser University and Cystic Fibrosis Canada. We thank additional project advisors for constructive comments. We have started a library of simple "recipe" scripts that extract quality control (QC) data from various reports like FastQC, QUAST, CheckM and SPAdes into the popular and software-friendly JSON format (an auto- generated HTML version of the same content is also available). One can override sections of an RCQC recipe with settings that test variations in a pipeline job. An example RCQC text-mining script and output HTML and JSON report is shown below along with typical report files from other pipeline tools. 1Department of Pathology, University of British Columbia; 2National Microbiology Laboratory, Public Health Agency of Canada; 3Department of Pathology, University of British Columbia & BC Public Health Microbiology and Reference Laboratory Damion M. Dooley1; Aaron J. Petkau2; Franklin Bristow2; Gary Van Domselaar2; William W.L. Hsiao3 A Scripting Language For Standardized Evaluation Of Quality Metrics In Galaxy And Command-line Driven Workflows This work stemmed from the plan to enhance QC reporting on the web- based Integrated Rapid Infectious Disease Analysis (www.IRIDA.ca) project which manages sequence libraries and pipelines for food-born pathogen assembly, annotation, SNP detection, and phylogenetic analysis. RCQC has been developed to work as a command-line python app, but in addition, since IRIDA uses Galaxy to execute its pipeline, we have a Galaxy RCQC tool for pro users to develop recipes. We will be offering a basic version of this tool that allows users without programming skills to adjust key QC parameters only. Recipes can include conditionals that trigger a halt to a pipeline by sending the appropriate signal (exit code). More than one RCQC recipe can be run in a pipeline, and their report output can be daisy chained in order to contribute to a single collective report. QC metric conditionals shown below can either signal a possible error situation (the fail(qc) call), or even call a halt to futile pipeline work (via fail(job )). adjusting parameters and formulae for pipeline operation one that did not require recompilation after each user-driven change. As a result, the RCQC system provides a more transparent rule set that reduces the skill needed to make process adjustments. Standard assembly pipeline QC metrics are introduced which provide a blueprint for the way QC components could be shared amongst NGS sequencing pipelines. Further information, including source code, is available at https://github.com/Public-Health-Bioinformatics/rcqc. Implementation Protege ontology editor view of GenEpiO assembly quality control terms JSON-LDHTML FLASHFastQC CheckM RCQC recipe for text-mining flash.log In developing a scripting language to do this work, we did not want to reinvent the wheel (in fact RCQC offers up for reuse all of pythons built-in math and operator functions). We did however need a flexible mechanism for FLASH