ºÝºÝߣ

ºÝºÝߣShare a Scribd company logo
Regexes:
It's magic!
¡°Some people, when confronted with a problem,
think 'I know, I'll use regular expressions!'
Now they have two problems.¡±
Nerd talk: regexes
Nerd talk: regexes
*
Perl style regex:
It's magic done right!
Metacharacters
^ beginning
$ end
. anything
 escape

/^....G..AA$/
Escaped characters
s whitespace

/^wwwwGwwAA$/

S not-whitespace

/^dddddddd$/

w word
d digit
. dot
 counterslash
Repetition
? 0 or 1 time

/^w{4}Gw{2}AA$/

* 0 or more times

/^d{1,2}d{1,2}d{2,4}$/

+ 1 or more times
*? ungreedy *
+? ungreedy +
{m} m times
{m, n} m up to n times
{m, n}? ungreedy {m,n}
Grouping
[ABC] any of these
characters
(AB|BC|CA) any of
these expressions
(THIS!) save this
[A-Za-z0-9] ranges

/^[ACTG]{4}G[ACTG]
{2}AA$/
/^(0?[1-9]|[0-2]d|3[01])
(0?d|1[0-2])
(d{2}|d{4})$/
OVERKILL

http://nbviewer.ipython.org/url/norvig.com/ipython/xkcd1313.ipynb
In Python (sigh...)
E.g.: finding files
E.g.: finding files

iel'
v 'Dan
' | grep
bo
p -v 'bu
e
'->' | gr
ep
-la | gr
ls
E.g.: demultiplexing fasta
1. Barcode
2. Primer
3. Random nucleotides

grep -P '1:N:0:ACTGGTT' -A3 ¨Cno-group-separator
multiplex_R1.fastq | grep -P '^[ACTGN]
{4}CCC[ACGT]T[GC]AGATA' -A2 -B1 --no-group-separator >
deplexed_R1.fq
E.g.: paper figures!
From the subset of unique sequences that span the
entire region under study, how many unique
sequences are matched by each primer combination?
Sed: find & replace
¡°Are you gonna talk about
vim regexes?¡±
¡°Sed regexes are weird¡±
My work around:
use ranges
[0-9]
[A-Z]
[a-z]
[A-Za-z]
Sed: find & replace
¡°Are you gonna talk about
vim regexes?¡±
Sed regexes are weird¡±
My work around:
use ranges
[0-9]
[A-Z]
[a-z]
[A-Za-z]

E.g.:
¡°Oh noes, Americans don't know how to
separate decimals!¡±
sed 's/./,/g' hisfile.tab > myfile.tab
¡°Oh noes, this bloody file was edited in
Windows!¡±
sed 's/r/n/' theirfile.tab > decentfile.tab
¡°Oh noes, Cassava 1.6 has a slash in it!¡±
sed 's,/1, 1:N:0:NNNNNN,' oldfile.fq > newfile.fq
Other neat stuff
grep (-c)
sort (-n, -r, -k, -t)
uniq -c
LMGTFY:
sed
http://www.tutorialspoint.com/unix/unix-regular-expressions.htm
grep
http://linux.about.com/od/commands/l/blcmdl1_grep.htm
Perl
http://www.cs.tut.fi/~jkorpela/perl/regexp.html
Python
http://docs.python.org/2/howto/regex.html
Vim
http://vimregex.com/
sed 's/fear of regex/love of regex/g'

More Related Content

Similar to Nerd talk: regexes (9)

Latex Symbols.pdf
Latex Symbols.pdfLatex Symbols.pdf
Latex Symbols.pdf
Komal526846
?
symbols of Latex.pdf
symbols of Latex.pdfsymbols of Latex.pdf
symbols of Latex.pdf
MdNaim23
?
Ruby presentasjon p? NTNU 22 april 2009
Ruby presentasjon p? NTNU 22 april 2009Ruby presentasjon p? NTNU 22 april 2009
Ruby presentasjon p? NTNU 22 april 2009
Aslak Helles?y
?
Ruby presentasjon p? NTNU 22 april 2009
Ruby presentasjon p? NTNU 22 april 2009Ruby presentasjon p? NTNU 22 april 2009
Ruby presentasjon p? NTNU 22 april 2009
Aslak Helles?y
?
Ruby presentasjon p? NTNU 22 april 2009
Ruby presentasjon p? NTNU 22 april 2009Ruby presentasjon p? NTNU 22 april 2009
Ruby presentasjon p? NTNU 22 april 2009
Aslak Helles?y
?
Regular Expression
Regular ExpressionRegular Expression
Regular Expression
Lambert Lum
?
Regular Expressions
Regular ExpressionsRegular Expressions
Regular Expressions
Satya Narayana
?
Good Evils In Perl
Good Evils In PerlGood Evils In Perl
Good Evils In Perl
Kang-min Liu
?
Working with text, Regular expressions
Working with text, Regular expressionsWorking with text, Regular expressions
Working with text, Regular expressions
Krasimir Berov (§¬§â§Ñ§ã§Ú§Þ§Ú§â §¢§Ö§â§à§Ó)
?

Recently uploaded (20)

Benefits of Moving Ellucian Banner to Oracle Cloud
Benefits of Moving Ellucian Banner to Oracle CloudBenefits of Moving Ellucian Banner to Oracle Cloud
Benefits of Moving Ellucian Banner to Oracle Cloud
AstuteBusiness
?
Dragino¥×¥í¥À¥¯¥È¥«¥¿¥í¥° LoRaWAN NB-IoT LTE cat.M1ÉÌÆ·¥ê¥¹¥È
Dragino¥×¥í¥À¥¯¥È¥«¥¿¥í¥° LoRaWAN  NB-IoT  LTE cat.M1ÉÌÆ·¥ê¥¹¥ÈDragino¥×¥í¥À¥¯¥È¥«¥¿¥í¥° LoRaWAN  NB-IoT  LTE cat.M1ÉÌÆ·¥ê¥¹¥È
Dragino¥×¥í¥À¥¯¥È¥«¥¿¥í¥° LoRaWAN NB-IoT LTE cat.M1ÉÌÆ·¥ê¥¹¥È
CRI Japan, Inc.
?
Presentation Session 2 -Context Grounding.pdf
Presentation Session 2 -Context Grounding.pdfPresentation Session 2 -Context Grounding.pdf
Presentation Session 2 -Context Grounding.pdf
Mukesh Kala
?
When Platform Engineers meet SREs - The Birth of O11y-as-a-Service Superpowers
When Platform Engineers meet SREs - The Birth of O11y-as-a-Service SuperpowersWhen Platform Engineers meet SREs - The Birth of O11y-as-a-Service Superpowers
When Platform Engineers meet SREs - The Birth of O11y-as-a-Service Superpowers
Eric D. Schabell
?
HHUG-04-2025-Close-more-deals-from-your-existing-pipeline-FOR SLIDESHARE.pptx
HHUG-04-2025-Close-more-deals-from-your-existing-pipeline-FOR SLIDESHARE.pptxHHUG-04-2025-Close-more-deals-from-your-existing-pipeline-FOR SLIDESHARE.pptx
HHUG-04-2025-Close-more-deals-from-your-existing-pipeline-FOR SLIDESHARE.pptx
HampshireHUG
?
Recruiting Tech: A Look at Why AI is Actually OG
Recruiting Tech: A Look at Why AI is Actually OGRecruiting Tech: A Look at Why AI is Actually OG
Recruiting Tech: A Look at Why AI is Actually OG
Matt Charney
?
The Rise of AI Agents-From Automation to Autonomous Technology
The Rise of AI Agents-From Automation to Autonomous TechnologyThe Rise of AI Agents-From Automation to Autonomous Technology
The Rise of AI Agents-From Automation to Autonomous Technology
Impelsys Inc.
?
Generative AI & Telco track at AMLD 2024
Generative AI & Telco track at AMLD 2024Generative AI & Telco track at AMLD 2024
Generative AI & Telco track at AMLD 2024
Laurent Ciavaglia
?
How AWS Encryption Key Options Impact Your Security and Compliance
How AWS Encryption Key Options Impact Your Security and ComplianceHow AWS Encryption Key Options Impact Your Security and Compliance
How AWS Encryption Key Options Impact Your Security and Compliance
Chris Bingham
?
202408_JAWSPANKRATION_Introduction_of_Minaden.pdf
202408_JAWSPANKRATION_Introduction_of_Minaden.pdf202408_JAWSPANKRATION_Introduction_of_Minaden.pdf
202408_JAWSPANKRATION_Introduction_of_Minaden.pdf
NTTDOCOMO-ServiceInnovation
?
Automated Engineering of Domain-Specific Metamorphic Testing Environments
Automated Engineering of Domain-Specific Metamorphic Testing EnvironmentsAutomated Engineering of Domain-Specific Metamorphic Testing Environments
Automated Engineering of Domain-Specific Metamorphic Testing Environments
Pablo G¨®mez Abajo
?
GDG Cloud Southlake #41: Shay Levi: Beyond the Hype:How Enterprises Are Using AI
GDG Cloud Southlake #41: Shay Levi: Beyond the Hype:How Enterprises Are Using AIGDG Cloud Southlake #41: Shay Levi: Beyond the Hype:How Enterprises Are Using AI
GDG Cloud Southlake #41: Shay Levi: Beyond the Hype:How Enterprises Are Using AI
James Anderson
?
The Future is Here ¨C Learn How to Get Started! Ionic App Development
The Future is Here ¨C Learn How to Get Started! Ionic App DevelopmentThe Future is Here ¨C Learn How to Get Started! Ionic App Development
The Future is Here ¨C Learn How to Get Started! Ionic App Development
7Pillars
?
How Air Coil Inductors Work By Cet Technology
How Air Coil Inductors Work By Cet TechnologyHow Air Coil Inductors Work By Cet Technology
How Air Coil Inductors Work By Cet Technology
CET Technology
?
Java on AWS Without the Headaches - Fast Builds, Cheap Deploys, No Kubernetes
Java on AWS Without the Headaches - Fast Builds, Cheap Deploys, No KubernetesJava on AWS Without the Headaches - Fast Builds, Cheap Deploys, No Kubernetes
Java on AWS Without the Headaches - Fast Builds, Cheap Deploys, No Kubernetes
VictorSzoltysek
?
Graphs & GraphRAG - Essential Ingredients for GenAI
Graphs & GraphRAG - Essential Ingredients for GenAIGraphs & GraphRAG - Essential Ingredients for GenAI
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j
?
Windows Client Privilege Escalation-Shared.pptx
Windows Client Privilege Escalation-Shared.pptxWindows Client Privilege Escalation-Shared.pptx
Windows Client Privilege Escalation-Shared.pptx
Oddvar Moe
?
UiPath Agentic automation with Autopilot for everyone + new features/releases
UiPath Agentic  automation with Autopilot for everyone + new features/releasesUiPath Agentic  automation with Autopilot for everyone + new features/releases
UiPath Agentic automation with Autopilot for everyone + new features/releases
DianaGray10
?
Columbia Weather Systems - Product Overview
Columbia Weather Systems - Product OverviewColumbia Weather Systems - Product Overview
Columbia Weather Systems - Product Overview
Columbia Weather Systems
?
ºÝºÝߣs from Perth MuleSoft Meetup March 2025
ºÝºÝߣs from Perth MuleSoft Meetup March 2025ºÝºÝߣs from Perth MuleSoft Meetup March 2025
ºÝºÝߣs from Perth MuleSoft Meetup March 2025
Michael Price
?
Benefits of Moving Ellucian Banner to Oracle Cloud
Benefits of Moving Ellucian Banner to Oracle CloudBenefits of Moving Ellucian Banner to Oracle Cloud
Benefits of Moving Ellucian Banner to Oracle Cloud
AstuteBusiness
?
Dragino¥×¥í¥À¥¯¥È¥«¥¿¥í¥° LoRaWAN NB-IoT LTE cat.M1ÉÌÆ·¥ê¥¹¥È
Dragino¥×¥í¥À¥¯¥È¥«¥¿¥í¥° LoRaWAN  NB-IoT  LTE cat.M1ÉÌÆ·¥ê¥¹¥ÈDragino¥×¥í¥À¥¯¥È¥«¥¿¥í¥° LoRaWAN  NB-IoT  LTE cat.M1ÉÌÆ·¥ê¥¹¥È
Dragino¥×¥í¥À¥¯¥È¥«¥¿¥í¥° LoRaWAN NB-IoT LTE cat.M1ÉÌÆ·¥ê¥¹¥È
CRI Japan, Inc.
?
Presentation Session 2 -Context Grounding.pdf
Presentation Session 2 -Context Grounding.pdfPresentation Session 2 -Context Grounding.pdf
Presentation Session 2 -Context Grounding.pdf
Mukesh Kala
?
When Platform Engineers meet SREs - The Birth of O11y-as-a-Service Superpowers
When Platform Engineers meet SREs - The Birth of O11y-as-a-Service SuperpowersWhen Platform Engineers meet SREs - The Birth of O11y-as-a-Service Superpowers
When Platform Engineers meet SREs - The Birth of O11y-as-a-Service Superpowers
Eric D. Schabell
?
HHUG-04-2025-Close-more-deals-from-your-existing-pipeline-FOR SLIDESHARE.pptx
HHUG-04-2025-Close-more-deals-from-your-existing-pipeline-FOR SLIDESHARE.pptxHHUG-04-2025-Close-more-deals-from-your-existing-pipeline-FOR SLIDESHARE.pptx
HHUG-04-2025-Close-more-deals-from-your-existing-pipeline-FOR SLIDESHARE.pptx
HampshireHUG
?
Recruiting Tech: A Look at Why AI is Actually OG
Recruiting Tech: A Look at Why AI is Actually OGRecruiting Tech: A Look at Why AI is Actually OG
Recruiting Tech: A Look at Why AI is Actually OG
Matt Charney
?
The Rise of AI Agents-From Automation to Autonomous Technology
The Rise of AI Agents-From Automation to Autonomous TechnologyThe Rise of AI Agents-From Automation to Autonomous Technology
The Rise of AI Agents-From Automation to Autonomous Technology
Impelsys Inc.
?
Generative AI & Telco track at AMLD 2024
Generative AI & Telco track at AMLD 2024Generative AI & Telco track at AMLD 2024
Generative AI & Telco track at AMLD 2024
Laurent Ciavaglia
?
How AWS Encryption Key Options Impact Your Security and Compliance
How AWS Encryption Key Options Impact Your Security and ComplianceHow AWS Encryption Key Options Impact Your Security and Compliance
How AWS Encryption Key Options Impact Your Security and Compliance
Chris Bingham
?
Automated Engineering of Domain-Specific Metamorphic Testing Environments
Automated Engineering of Domain-Specific Metamorphic Testing EnvironmentsAutomated Engineering of Domain-Specific Metamorphic Testing Environments
Automated Engineering of Domain-Specific Metamorphic Testing Environments
Pablo G¨®mez Abajo
?
GDG Cloud Southlake #41: Shay Levi: Beyond the Hype:How Enterprises Are Using AI
GDG Cloud Southlake #41: Shay Levi: Beyond the Hype:How Enterprises Are Using AIGDG Cloud Southlake #41: Shay Levi: Beyond the Hype:How Enterprises Are Using AI
GDG Cloud Southlake #41: Shay Levi: Beyond the Hype:How Enterprises Are Using AI
James Anderson
?
The Future is Here ¨C Learn How to Get Started! Ionic App Development
The Future is Here ¨C Learn How to Get Started! Ionic App DevelopmentThe Future is Here ¨C Learn How to Get Started! Ionic App Development
The Future is Here ¨C Learn How to Get Started! Ionic App Development
7Pillars
?
How Air Coil Inductors Work By Cet Technology
How Air Coil Inductors Work By Cet TechnologyHow Air Coil Inductors Work By Cet Technology
How Air Coil Inductors Work By Cet Technology
CET Technology
?
Java on AWS Without the Headaches - Fast Builds, Cheap Deploys, No Kubernetes
Java on AWS Without the Headaches - Fast Builds, Cheap Deploys, No KubernetesJava on AWS Without the Headaches - Fast Builds, Cheap Deploys, No Kubernetes
Java on AWS Without the Headaches - Fast Builds, Cheap Deploys, No Kubernetes
VictorSzoltysek
?
Graphs & GraphRAG - Essential Ingredients for GenAI
Graphs & GraphRAG - Essential Ingredients for GenAIGraphs & GraphRAG - Essential Ingredients for GenAI
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j
?
Windows Client Privilege Escalation-Shared.pptx
Windows Client Privilege Escalation-Shared.pptxWindows Client Privilege Escalation-Shared.pptx
Windows Client Privilege Escalation-Shared.pptx
Oddvar Moe
?
UiPath Agentic automation with Autopilot for everyone + new features/releases
UiPath Agentic  automation with Autopilot for everyone + new features/releasesUiPath Agentic  automation with Autopilot for everyone + new features/releases
UiPath Agentic automation with Autopilot for everyone + new features/releases
DianaGray10
?
ºÝºÝߣs from Perth MuleSoft Meetup March 2025
ºÝºÝߣs from Perth MuleSoft Meetup March 2025ºÝºÝߣs from Perth MuleSoft Meetup March 2025
ºÝºÝߣs from Perth MuleSoft Meetup March 2025
Michael Price
?

Nerd talk: regexes