際際滷

際際滷Share a Scribd company logo
TRAP STATUS UPDATE
TRAnsients Pipeline
Gijs Molenaar
gijs@pythonic.nl
@gijzelaerr
Thursday, July 11, 13
ABOUTTRAP
 TRAnsients Pipeline
 Detect and classify transients in multi-frequency radio sky
image time series
 Emit VOevents
 99% Python
Thursday, July 11, 13
STEPS
Thursday, July 11, 13
A LOT HAPPENED
 Version 1.0 imminent
 Focused on code quality and performance
 No big new science features
Thursday, July 11, 13
PERFORMANCE
 A lot faster
 Really a lot faster
 0.85 image per second
per core
 Scales well
minutes
Thursday, July 11, 13
RSM CYCLE0 RUN0
 3402 images
 processing record - 5:21 min
 2 machines, 36 cores
 5645 unique sources
 667 detected transients
 previous version: 400 min on
40 cores
Thursday, July 11, 13
TRAP & AARTFAAC
 AARTFAAC
 48 images/s
 57 (real) cores required
 1 or 2 big fat systems will do!
Thursday, July 11, 13
INSTALLABILITY
 Merged TKP into TRAP
 Almost open source
 Easy database setup
 Remove many dependencies
 Like Lofar System Software (closed source)
Thursday, July 11, 13
QUALITY CONTROL
 Automated rejection of
bad images
 Known bright source in FOV
 RMS x times higher than
theoretical noise
 oversampled / undersampled /
highly elliptical
Thursday, July 11, 13
STORAGE
 Added support for PostgreSQL
 fast with small datasets
 Many off-the-shelf tools available
Thursday, July 11, 13
UNDERTHE HOOD
 Switched to celery
 asynchronous job queue
 based on distributed
message passing
 No more cuisine
Thursday, July 11, 13
WHY CELERY
 Easier to use / install / debug
 Faster - hot processes
 Many off-the-shelf tools
 CEP1 compatible
 Easy to add compute nodes
Thursday, July 11, 13
Thursday, July 11, 13
DISCO?
 Maybe add support for
Disco in the future
 Similar
 Map - Reduce
 Hadoop for Python
 Distributed 鍖le system
Thursday, July 11, 13
USABILITY
 tkp-manage.py
 Pipeline management tool
 Inspired by Django
manage.py command
 Easy to
 setup pipeline
 add and run jobs
 run celery workers
 Add new commands
Thursday, July 11, 13
DEMO?
Thursday, July 11, 13
SUPPORTEDTELESCOPES
 Support for FITS and CASA tables
 鍖eld parsers for LOFAR
 Possible to add telescope speci鍖c 鍖eld parsing and
quality checks
 ThunderKAT next week
Thursday, July 11, 13
PROJECT CLEANUP
 removed 40% of code
 80% unit tested
 Added jenkins build server
 Performance regression tests
 Pull request/review work 鍖ow
 hipchat for central communication
Thursday, July 11, 13
WEB INTERFACE BANANA
 New web interface
 Rewrite of TKP-web
 Future ready
 Scientist friendly
Thursday, July 11, 13
Thursday, July 11, 13
DEMO?
Thursday, July 11, 13
FUTURE WORK
 More stable releases
 Add support for non-LOFAR data
 More quality checks
 Source storage and association
performance
 Distributed 鍖le system
 Automated classi鍖cation
 Web based data exploration
Thursday, July 11, 13
QUESTIONS
gijs@pythonic.nl
@gijzelaerr
Thursday, July 11, 13

More Related Content

What's hot (20)

ODP
Managing your SaltStack Minions with Foreman
Stephen Benjamin
PPTX
Eac integrations JS LiveStream
ChronoLogic
PPT
SaltConf14 - Brendan Burns, Google - Management at Google Scale
SaltStack
PDF
Connecting AWS and Katello/The Foreman
Lukas Kallies
PDF
Stream Processing with Apache Flink
C4Media
PDF
Reactive programming with Rxjava
Christophe Marchal
PDF
Kafka Summit NYC 2017 - The Source of Truth: Why the New York Times Stores Ev...
confluent
PPTX
Portable Streaming Pipelines with Apache Beam
confluent
PDF
How static analysis supports quality over 50 million lines of C++ code
cppfrug
PDF
Monitorama 2015 Netflix Instance Analysis
Brendan Gregg
PDF
Infrastructure & System Monitoring using Prometheus
Marco Pas
DOCX
Addressing data plane performance measurement on OpenStack clouds using VMTP
Suhail Syed
PDF
Puppet Camp Chicago 2014: Running Multiple Puppet Masters (Beginner)
Puppet
PPTX
Integrating Bare-metal Provisioning into CERN's Private Cloud
Arne Wiebalck
PPTX
Operational War Stories from 5 Years of Running OpenStack in Production
Arne Wiebalck
PDF
Salt Air 19 - Intro to SaltStack RAET (reliable asyncronous event transport)
SaltStack
PDF
OSMC 2021 | Robotmk: You dont run IT you deliver services!
NETWAYS
ODP
Lightcycle
Nathaniel Eliot
PPTX
Training Going Async
Betclic Everest Group Tech Team
PDF
Flink Forward SF 2017: Scott Kidder - Building a Real-Time Anomaly-Detection ...
Flink Forward
Managing your SaltStack Minions with Foreman
Stephen Benjamin
Eac integrations JS LiveStream
ChronoLogic
SaltConf14 - Brendan Burns, Google - Management at Google Scale
SaltStack
Connecting AWS and Katello/The Foreman
Lukas Kallies
Stream Processing with Apache Flink
C4Media
Reactive programming with Rxjava
Christophe Marchal
Kafka Summit NYC 2017 - The Source of Truth: Why the New York Times Stores Ev...
confluent
Portable Streaming Pipelines with Apache Beam
confluent
How static analysis supports quality over 50 million lines of C++ code
cppfrug
Monitorama 2015 Netflix Instance Analysis
Brendan Gregg
Infrastructure & System Monitoring using Prometheus
Marco Pas
Addressing data plane performance measurement on OpenStack clouds using VMTP
Suhail Syed
Puppet Camp Chicago 2014: Running Multiple Puppet Masters (Beginner)
Puppet
Integrating Bare-metal Provisioning into CERN's Private Cloud
Arne Wiebalck
Operational War Stories from 5 Years of Running OpenStack in Production
Arne Wiebalck
Salt Air 19 - Intro to SaltStack RAET (reliable asyncronous event transport)
SaltStack
OSMC 2021 | Robotmk: You dont run IT you deliver services!
NETWAYS
Lightcycle
Nathaniel Eliot
Training Going Async
Betclic Everest Group Tech Team
Flink Forward SF 2017: Scott Kidder - Building a Real-Time Anomaly-Detection ...
Flink Forward

Viewers also liked (9)

PPTX
仆舒亞舒亟 舒亰舒从 丐..
舒亳仆舒 弌亠仗舒仆磻
PPT
仄亳 于仍亠亠仆亳亶 亠仄亳 从舒亰舒从
舒亳仆舒 弌亠仗舒仆磻
PPTX
亳亰亟舒仆亳
舒亳仆舒 弌亠仗舒仆磻
ODP
Pygrunn 2013
Gijs Molenaar
PPTX
仆仆舒 于舒仆仂于仆舒
舒亳仆舒 弌亠仗舒仆磻
DOCX
Give back
Marc Garofalo
PPT
仗舒仗舒, 仄舒仄舒, 仗亠亟舒亞仂亞亳亠从舒 亠仄 (从舒亰舒从)
舒亳仆舒 弌亠仗舒仆磻
PPTX
WHO yuta aikawa
Yuta Aikawa
PPTX
仗亠亰亠仆舒亳
舒亳仆舒 弌亠仗舒仆磻
仆舒亞舒亟 舒亰舒从 丐..
舒亳仆舒 弌亠仗舒仆磻
仄亳 于仍亠亠仆亳亶 亠仄亳 从舒亰舒从
舒亳仆舒 弌亠仗舒仆磻
亳亰亟舒仆亳
舒亳仆舒 弌亠仗舒仆磻
Pygrunn 2013
Gijs Molenaar
仆仆舒 于舒仆仂于仆舒
舒亳仆舒 弌亠仗舒仆磻
Give back
Marc Garofalo
仗舒仗舒, 仄舒仄舒, 仗亠亟舒亞仂亞亳亠从舒 亠仄 (从舒亰舒从)
舒亳仆舒 弌亠仗舒仆磻
WHO yuta aikawa
Yuta Aikawa
仗亠亰亠仆舒亳
舒亳仆舒 弌亠仗舒仆磻
Ad

Recently uploaded (20)

PPTX
MARTSIA: A Tool for Confidential Data Exchange via Public Blockchain - Pitch ...
Michele Kryston
PDF
Hello I'm "AI" Your New _________________
Dr. Tathagat Varma
PDF
ArcGIS Utility Network Migration - The Hunter Water Story
Safe Software
PPTX
Simplifica la seguridad en la nube y la detecci坦n de amenazas con FortiCNAPP
Cristian Garcia G.
PDF
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
PDF
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
Earley Information Science
PDF
LLM Search Readiness Audit - Dentsu x SEO Square - June 2025.pdf
Nick Samuel
PPTX
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
PDF
The Growing Value and Application of FME & GenAI
Safe Software
PDF
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
yosra Saidani
PDF
Scaling i.MX Applications Processors Native Edge AI with Discrete AI Accele...
Edge AI and Vision Alliance
PDF
From Chatbot to Destroyer of Endpoints - Can ChatGPT Automate EDR Bypasses (1...
Priyanka Aash
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
PPTX
叶Wondershare Filmora Crack 14.0.7 + Key Download 2025
sebastian aliya
PDF
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Safe Software
DOCX
Daily Lesson Log MATATAG ICT TEchnology 8
LOIDAALMAZAN3
PDF
Open Source Milvus Vector Database v 2.6
Zilliz
PDF
The Future of Product Management in AI ERA.pdf
Alyona Owens
PPTX
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
PPTX
reInforce 2025 Lightning Talk - Scott Francis.pptx
ScottFrancis51
MARTSIA: A Tool for Confidential Data Exchange via Public Blockchain - Pitch ...
Michele Kryston
Hello I'm "AI" Your New _________________
Dr. Tathagat Varma
ArcGIS Utility Network Migration - The Hunter Water Story
Safe Software
Simplifica la seguridad en la nube y la detecci坦n de amenazas con FortiCNAPP
Cristian Garcia G.
Quantum AI Discoveries: Fractal Patterns Consciousness and Cyclical Universes
Saikat Basu
EIS-Webinar-Engineering-Retail-Infrastructure-06-16-2025.pdf
Earley Information Science
LLM Search Readiness Audit - Dentsu x SEO Square - June 2025.pdf
Nick Samuel
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
The Growing Value and Application of FME & GenAI
Safe Software
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
yosra Saidani
Scaling i.MX Applications Processors Native Edge AI with Discrete AI Accele...
Edge AI and Vision Alliance
From Chatbot to Destroyer of Endpoints - Can ChatGPT Automate EDR Bypasses (1...
Priyanka Aash
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
叶Wondershare Filmora Crack 14.0.7 + Key Download 2025
sebastian aliya
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Safe Software
Daily Lesson Log MATATAG ICT TEchnology 8
LOIDAALMAZAN3
Open Source Milvus Vector Database v 2.6
Zilliz
The Future of Product Management in AI ERA.pdf
Alyona Owens
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
reInforce 2025 Lightning Talk - Scott Francis.pptx
ScottFrancis51
Ad

TRAP (transient detection pipeline) status update

  • 1. TRAP STATUS UPDATE TRAnsients Pipeline Gijs Molenaar gijs@pythonic.nl @gijzelaerr Thursday, July 11, 13
  • 2. ABOUTTRAP TRAnsients Pipeline Detect and classify transients in multi-frequency radio sky image time series Emit VOevents 99% Python Thursday, July 11, 13
  • 4. A LOT HAPPENED Version 1.0 imminent Focused on code quality and performance No big new science features Thursday, July 11, 13
  • 5. PERFORMANCE A lot faster Really a lot faster 0.85 image per second per core Scales well minutes Thursday, July 11, 13
  • 6. RSM CYCLE0 RUN0 3402 images processing record - 5:21 min 2 machines, 36 cores 5645 unique sources 667 detected transients previous version: 400 min on 40 cores Thursday, July 11, 13
  • 7. TRAP & AARTFAAC AARTFAAC 48 images/s 57 (real) cores required 1 or 2 big fat systems will do! Thursday, July 11, 13
  • 8. INSTALLABILITY Merged TKP into TRAP Almost open source Easy database setup Remove many dependencies Like Lofar System Software (closed source) Thursday, July 11, 13
  • 9. QUALITY CONTROL Automated rejection of bad images Known bright source in FOV RMS x times higher than theoretical noise oversampled / undersampled / highly elliptical Thursday, July 11, 13
  • 10. STORAGE Added support for PostgreSQL fast with small datasets Many off-the-shelf tools available Thursday, July 11, 13
  • 11. UNDERTHE HOOD Switched to celery asynchronous job queue based on distributed message passing No more cuisine Thursday, July 11, 13
  • 12. WHY CELERY Easier to use / install / debug Faster - hot processes Many off-the-shelf tools CEP1 compatible Easy to add compute nodes Thursday, July 11, 13
  • 14. DISCO? Maybe add support for Disco in the future Similar Map - Reduce Hadoop for Python Distributed 鍖le system Thursday, July 11, 13
  • 15. USABILITY tkp-manage.py Pipeline management tool Inspired by Django manage.py command Easy to setup pipeline add and run jobs run celery workers Add new commands Thursday, July 11, 13
  • 17. SUPPORTEDTELESCOPES Support for FITS and CASA tables 鍖eld parsers for LOFAR Possible to add telescope speci鍖c 鍖eld parsing and quality checks ThunderKAT next week Thursday, July 11, 13
  • 18. PROJECT CLEANUP removed 40% of code 80% unit tested Added jenkins build server Performance regression tests Pull request/review work 鍖ow hipchat for central communication Thursday, July 11, 13
  • 19. WEB INTERFACE BANANA New web interface Rewrite of TKP-web Future ready Scientist friendly Thursday, July 11, 13
  • 22. FUTURE WORK More stable releases Add support for non-LOFAR data More quality checks Source storage and association performance Distributed 鍖le system Automated classi鍖cation Web based data exploration Thursday, July 11, 13