際際滷

際際滷Share a Scribd company logo
JPEG 2000 at the Wellcome
Library
Christy Henshaw
Digitisation Programme Manager  Wellcome Library
JP2 Summit
12-13 May 2011
Library of Congress
The Wellcome Trust
A global charitable foundation
Achieving extraordinary improvements in human and animal health
Supporting the brightest minds in biomedical research and the
medical humanities
Exploring medicine in historical and cultural contexts
The Wellcome Library
The Wellcome Library
Collections of books, manuscripts, archives, films and pictures on the
history of medicine from the earliest times to the present day .
The Wellcome Digital Library pilot,
2010-2013
Genetics and its Modern Foundations
A new online resource for everyone interested in the history of
human and animal health.

Aims
 build sustainable/expandable mechanism  foundation stone
for WDL
 digitise key library holdings - relating to a major Trust
challenge area
 digitise important third party content  linked to theme
 use innovative content and tools  to encourage discovery and
use
 explore commercial partnerships  enhance access to nontheme material
JPEG 2000 conversion  scope
Wellcome Images  image library, legacy images, 300,000
images in the archive
Current projects  pilot digitisation projects, 7m images 2010 2014

 Long-term plans  digitisation of large proportion of our
collections (mainly special collections), 15m  25m images 2014
and beyond
Type of content
Printed books  early printed books, modern books
(monographs), pamphlets, reports
Archives  personal papers, institutional papers, unpublished
works, mostly 20th century
 Manuscripts  unpublished, handwritten manuscript books and
related materials, mostly 17th, 18th and 19th century, can be fragile

 Artworks  prints, paintings, posters, drawings, glass slides, etc.
The Francis Crick Archive
Books related to genetic research
Early printed books
Artworks, manuscripts
Decision to adopt JP2
JPEG 2000 was found to answer the following needs:
Storage costs 20/30m TIFFs stored on online, backed-up
storage = multiple petabytes. Needed something cost-effective.
Quality  needed a high-quality compressed format that would
cover a wide range of content types.
 Robustness  needed a well-established image format with a
high chance of long-term support.
 Practical  feasible to use in a Library digitisation workflow.
Finding our way
Working with JP2 opened up a whole new world  reading
specifications, finding conversion software, so many choices.

Commissioned the report:
JPEG 2000 as a Preservation and Access Format for the Wellcome Trust Libr
Goal to find a single version of JPEG 2000 that would meet the
needs of both long-term preservation and flexible delivery needs.
The result
Parameter

Settings

File format

Part 1 (.jp2)

Compression

Lossy (6:1, 10:1)

Tiling

1024 x 1024

Progression order

RLCP

Decomp levels

5

Quality layers

8

Code block size

6, 64x64

Regions of interest

No

TLM markers

Yes

Bypass

N/A
Embedding JP2
Chose LuraWave command line tool
 Some issues (bugs, or inconvenient implementations) arose, and
all have been successfully addressed by LuraTech
 Created a firm consensus to use JP2 as the format for all stillimage digital imaging (with one or two exceptions)
 No plans to use JP2 for digital video  but never say never
 Internal information sharing  digital archivists, systems
administrators, IT department, programme board members
 External communication and networking
Current status, future plans
 Conversion of all new digital images is now carried out as
standard
 Nearing the final stages of a project to convert 450k image
backlog to JP2 (reducing current footprint from 20 Tb to 5.5 Tb)
 Large projects use lossy JP2, legacy picture library uses lossless
 Developed a strategy to determine compression levels
 Currently using the GUI, but will use the command line interface
with our new workflow system, streamlining conversion and QA
 Medium term, will look at automating compression level selection
Quality control for compression
 Visual inspection
 Color shifts, loss of detail, halo effects, pixelation, blurring, etc.
 Collection-based, representative sample
 Test range of compressions with intervals such as 2:1, 4:1, 6:1
 Once artefacts are discovered, step back to previous
compression ratio
 Worst-performing image rules, for any particular collection
 Efficient for homogenous collections  less so for heterogenous
collections with wide variety of content
 Archives particularly difficult  black and white compresses very
well  colour drawings and photographs, not so well
Establishing the JP2K-UK group
 Unknown who in the UK were using JPEG 2000, or considering it
 Unknown who was even interested in JPEG 2000
 No one wants to work in a vacuum
 Discovered a high level of interest: British Library, The National
Archives, Oxford, Kings College London, Cambridge and
Southampton Universities, Digital Preservation Coalition,
commercial companies/consultants
 Loose affiliation of the like-minded  a user group
Remit of the JP2K-UK group
 Initial meeting in December 2009
 Everyone had a little knowledge  no one knew enough
 Agreed the need to approach JP2 implementation from
practitioners point of view
 Practitioner meaning those who manage digital imaging
strategies and implementation
 Agreed need to share information and collaborate
 Discussed ideas for a conference, and creating some guidelines
for the user community
 Wellcome encouraged to write a blog about specific experiences
working with JP2
Ouputs
 JPEG 2000 Seminar, held in London in November 2010
> 80 attendees
> UK and European speakers and delegates
> mostly non-technical audience
 Advocacy for practitioners needs
> discussing and airing the needs and concerns of
practitioners has influenced software developers, and even the
JPEG Committee
> JPEG

2000 at the Wellcome Library blog
www.jpeg2000wellcomelibrary.blogspot.com
Future plans for JP2K-UK
 Guidance for practitioners
> Human readable
> Focus on practicalities
> Enable practitioners to make informed choices
> Advice on implementation
 Community building
> Case studies
> Lessons learned
> Networking (nationally and internationally)

More Related Content

Similar to Jpeg2000 at Wellcome Library (20)

Newman, DAM + Image Intellectual Property Management
Newman, DAM + Image Intellectual Property ManagementNewman, DAM + Image Intellectual Property Management
Newman, DAM + Image Intellectual Property Management
Alan Newman
Wordofa presentation icadla2
Wordofa presentation icadla2Wordofa presentation icadla2
Wordofa presentation icadla2
Johannes Phaladi
Wordofa presentation icadla2
Wordofa presentation icadla2Wordofa presentation icadla2
Wordofa presentation icadla2
Johannes Phaladi
Of Communities and Practices: Digital Preservation Innovation & Research
Of Communities  and Practices: Digital Preservation Innovation & ResearchOf Communities  and Practices: Digital Preservation Innovation & Research
Of Communities and Practices: Digital Preservation Innovation & Research
Erwin Verbruggen
Digitizing Spectator - Libraries Digital Program
Digitizing Spectator - Libraries Digital ProgramDigitizing Spectator - Libraries Digital Program
Digitizing Spectator - Libraries Digital Program
Robert Frech
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
FIAT/IFTA
Ariadne overview
Ariadne overviewAriadne overview
Ariadne overview
ariadnenetwork
Evolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of MedicineEvolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of Medicine
John Rees
20yrs: 2004 iPRES Beijing e-journals
20yrs: 2004 iPRES Beijing e-journals20yrs: 2004 iPRES Beijing e-journals
20yrs: 2004 iPRES Beijing e-journals
Neil Beagrie
"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica
Jenny Mitcham
Digital projects best practices [xxxiii reuni坦n nacional de archivos 201111]
Digital projects best practices [xxxiii reuni坦n nacional de archivos 201111]Digital projects best practices [xxxiii reuni坦n nacional de archivos 201111]
Digital projects best practices [xxxiii reuni坦n nacional de archivos 201111]
Frederick Zarndt
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and Ceremony
Archiver
Research in the digital age - circa 2005
Research in the digital age - circa 2005Research in the digital age - circa 2005
Research in the digital age - circa 2005
Larry Naukam
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
ASIS&T
Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)
EDINA, University of Edinburgh
Constructing bottomup
Constructing bottomupConstructing bottomup
Constructing bottomup
Alex Hardisty
SCAPE Presentation at the Elag2013 conference in Gent/Belgium
SCAPE Presentation at the Elag2013 conference in Gent/BelgiumSCAPE Presentation at the Elag2013 conference in Gent/Belgium
SCAPE Presentation at the Elag2013 conference in Gent/Belgium
Sven Schlarb
Digitising Hansard
Digitising HansardDigitising Hansard
Digitising Hansard
ALISS
"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica
Jenny Mitcham
Resource description and new media : challenges and opportunities. Authors: E...
Resource description and new media : challenges and opportunities. Authors: E...Resource description and new media : challenges and opportunities. Authors: E...
Resource description and new media : challenges and opportunities. Authors: E...
UCD Library
Newman, DAM + Image Intellectual Property Management
Newman, DAM + Image Intellectual Property ManagementNewman, DAM + Image Intellectual Property Management
Newman, DAM + Image Intellectual Property Management
Alan Newman
Wordofa presentation icadla2
Wordofa presentation icadla2Wordofa presentation icadla2
Wordofa presentation icadla2
Johannes Phaladi
Wordofa presentation icadla2
Wordofa presentation icadla2Wordofa presentation icadla2
Wordofa presentation icadla2
Johannes Phaladi
Of Communities and Practices: Digital Preservation Innovation & Research
Of Communities  and Practices: Digital Preservation Innovation & ResearchOf Communities  and Practices: Digital Preservation Innovation & Research
Of Communities and Practices: Digital Preservation Innovation & Research
Erwin Verbruggen
Digitizing Spectator - Libraries Digital Program
Digitizing Spectator - Libraries Digital ProgramDigitizing Spectator - Libraries Digital Program
Digitizing Spectator - Libraries Digital Program
Robert Frech
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
FIAT/IFTA
Evolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of MedicineEvolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of Medicine
John Rees
20yrs: 2004 iPRES Beijing e-journals
20yrs: 2004 iPRES Beijing e-journals20yrs: 2004 iPRES Beijing e-journals
20yrs: 2004 iPRES Beijing e-journals
Neil Beagrie
"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica
Jenny Mitcham
Digital projects best practices [xxxiii reuni坦n nacional de archivos 201111]
Digital projects best practices [xxxiii reuni坦n nacional de archivos 201111]Digital projects best practices [xxxiii reuni坦n nacional de archivos 201111]
Digital projects best practices [xxxiii reuni坦n nacional de archivos 201111]
Frederick Zarndt
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and Ceremony
Archiver
Research in the digital age - circa 2005
Research in the digital age - circa 2005Research in the digital age - circa 2005
Research in the digital age - circa 2005
Larry Naukam
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
ASIS&T
Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)
EDINA, University of Edinburgh
Constructing bottomup
Constructing bottomupConstructing bottomup
Constructing bottomup
Alex Hardisty
SCAPE Presentation at the Elag2013 conference in Gent/Belgium
SCAPE Presentation at the Elag2013 conference in Gent/BelgiumSCAPE Presentation at the Elag2013 conference in Gent/Belgium
SCAPE Presentation at the Elag2013 conference in Gent/Belgium
Sven Schlarb
Digitising Hansard
Digitising HansardDigitising Hansard
Digitising Hansard
ALISS
"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica
Jenny Mitcham
Resource description and new media : challenges and opportunities. Authors: E...
Resource description and new media : challenges and opportunities. Authors: E...Resource description and new media : challenges and opportunities. Authors: E...
Resource description and new media : challenges and opportunities. Authors: E...
UCD Library

More from Wellcome Library (12)

ProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner PerspectiveProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner Perspective
Wellcome Library
Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9
Wellcome Library
Doing Projects: 10 laws of digitisation
Doing Projects: 10 laws of digitisationDoing Projects: 10 laws of digitisation
Doing Projects: 10 laws of digitisation
Wellcome Library
Systems and Processes: making order out of chaos
Systems and Processes: making order out of chaosSystems and Processes: making order out of chaos
Systems and Processes: making order out of chaos
Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome LibraryCopyright clearance for genetics books - a pilot project at the Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome Library
Wellcome Library
Systems, processes & how we stop the wheels falling off
Systems, processes & how we stop the wheels falling offSystems, processes & how we stop the wheels falling off
Systems, processes & how we stop the wheels falling off
Wellcome Library
Digitisation Projects at Wellcome Library
Digitisation Projects at Wellcome LibraryDigitisation Projects at Wellcome Library
Digitisation Projects at Wellcome Library
Wellcome Library
How will history remember you?
How will history remember you?How will history remember you?
How will history remember you?
Wellcome Library
Image Capture
Image CaptureImage Capture
Image Capture
Wellcome Library
Conservation for Digitisation
Conservation for DigitisationConservation for Digitisation
Conservation for Digitisation
Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome LibraryCopyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Wellcome Library
Mandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome TrustMandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome Trust
Wellcome Library
ProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner PerspectiveProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner Perspective
Wellcome Library
Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9
Wellcome Library
Doing Projects: 10 laws of digitisation
Doing Projects: 10 laws of digitisationDoing Projects: 10 laws of digitisation
Doing Projects: 10 laws of digitisation
Wellcome Library
Systems and Processes: making order out of chaos
Systems and Processes: making order out of chaosSystems and Processes: making order out of chaos
Systems and Processes: making order out of chaos
Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome LibraryCopyright clearance for genetics books - a pilot project at the Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome Library
Wellcome Library
Systems, processes & how we stop the wheels falling off
Systems, processes & how we stop the wheels falling offSystems, processes & how we stop the wheels falling off
Systems, processes & how we stop the wheels falling off
Wellcome Library
Digitisation Projects at Wellcome Library
Digitisation Projects at Wellcome LibraryDigitisation Projects at Wellcome Library
Digitisation Projects at Wellcome Library
Wellcome Library
How will history remember you?
How will history remember you?How will history remember you?
How will history remember you?
Wellcome Library
Conservation for Digitisation
Conservation for DigitisationConservation for Digitisation
Conservation for Digitisation
Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome LibraryCopyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Wellcome Library
Mandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome TrustMandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome Trust
Wellcome Library

Recently uploaded (20)

Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Bedrock Data Automation (Preview): Simplifying Unstructured Data ProcessingBedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Zilliz
Supercharge Your Career with UiPath Certifications
Supercharge Your Career with UiPath CertificationsSupercharge Your Career with UiPath Certifications
Supercharge Your Career with UiPath Certifications
DianaGray10
Blockchain for Businesses Practical Use Cases & Benefits.pdf
Blockchain for Businesses Practical Use Cases & Benefits.pdfBlockchain for Businesses Practical Use Cases & Benefits.pdf
Blockchain for Businesses Practical Use Cases & Benefits.pdf
Yodaplus Technologies Private Limited
AI Trends and Fun Demos Sothebys Rehoboth Presentation
AI Trends and Fun Demos  Sothebys Rehoboth PresentationAI Trends and Fun Demos  Sothebys Rehoboth Presentation
AI Trends and Fun Demos Sothebys Rehoboth Presentation
Ethan Holland
The Constructor's Digital Transformation Playbook: Reducing Risk With Technology
The Constructor's Digital Transformation Playbook: Reducing Risk With TechnologyThe Constructor's Digital Transformation Playbook: Reducing Risk With Technology
The Constructor's Digital Transformation Playbook: Reducing Risk With Technology
Aggregage
TrustArc Webinar: State of State Privacy Laws
TrustArc Webinar: State of State Privacy LawsTrustArc Webinar: State of State Privacy Laws
TrustArc Webinar: State of State Privacy Laws
TrustArc
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
William Maclyn Murphy McRae - A Seasoned Professional Renowned
William Maclyn Murphy McRae - A Seasoned Professional RenownedWilliam Maclyn Murphy McRae - A Seasoned Professional Renowned
William Maclyn Murphy McRae - A Seasoned Professional Renowned
William Maclyn Murphy McRae
AMER Introduction to ThousandEyes Webinar
AMER Introduction to ThousandEyes WebinarAMER Introduction to ThousandEyes Webinar
AMER Introduction to ThousandEyes Webinar
ThousandEyes
Big Data Analytics Quick Research Guide by Arthur Morgan (PREVIEW)
Big Data Analytics Quick Research Guide by Arthur Morgan (PREVIEW)Big Data Analytics Quick Research Guide by Arthur Morgan (PREVIEW)
Big Data Analytics Quick Research Guide by Arthur Morgan (PREVIEW)
Arthur Morgan
2025-02-24 - AWS meetup - Zilliz presentation.pdf
2025-02-24 - AWS meetup - Zilliz presentation.pdf2025-02-24 - AWS meetup - Zilliz presentation.pdf
2025-02-24 - AWS meetup - Zilliz presentation.pdf
Ivan Tang
UiPath Automation Developer Associate Training Series 2025 - Session 1
UiPath Automation Developer Associate Training Series 2025 - Session 1UiPath Automation Developer Associate Training Series 2025 - Session 1
UiPath Automation Developer Associate Training Series 2025 - Session 1
DianaGray10
5 Must-Use AI Tools to Supercharge Your Productivity
5 Must-Use AI Tools to Supercharge Your Productivity5 Must-Use AI Tools to Supercharge Your Productivity
5 Must-Use AI Tools to Supercharge Your Productivity
cryptouniversityoffi
Transcript: AI in publishing: Your questions answered - Tech Forum 2025
Transcript: AI in publishing: Your questions answered - Tech Forum 2025Transcript: AI in publishing: Your questions answered - Tech Forum 2025
Transcript: AI in publishing: Your questions answered - Tech Forum 2025
BookNet Canada
What is FinTech A Complete Guide to Financial Technology.pdf
What is FinTech A Complete Guide to Financial Technology.pdfWhat is FinTech A Complete Guide to Financial Technology.pdf
What is FinTech A Complete Guide to Financial Technology.pdf
Yodaplus Technologies Private Limited
Caching for Performance Masterclass: The In-Memory Datastore
Caching for Performance Masterclass: The In-Memory DatastoreCaching for Performance Masterclass: The In-Memory Datastore
Caching for Performance Masterclass: The In-Memory Datastore
ScyllaDB
SECURE BLOCKCHAIN FOR ADMISSION PROCESSING IN EDUCATIONAL INSTITUTIONS.pdf
SECURE BLOCKCHAIN FOR ADMISSION PROCESSING IN EDUCATIONAL INSTITUTIONS.pdfSECURE BLOCKCHAIN FOR ADMISSION PROCESSING IN EDUCATIONAL INSTITUTIONS.pdf
SECURE BLOCKCHAIN FOR ADMISSION PROCESSING IN EDUCATIONAL INSTITUTIONS.pdf
spub1985
Teaching Prompting and Prompt Sharing to End Users.pptx
Teaching Prompting and Prompt Sharing to End Users.pptxTeaching Prompting and Prompt Sharing to End Users.pptx
Teaching Prompting and Prompt Sharing to End Users.pptx
Michael Blumenthal (Microsoft MVP)
Deno ...................................
Deno ...................................Deno ...................................
Deno ...................................
Robert MacLean
What is Blockchain and How Can Blockchain Consulting Help Businesses.pdf
What is Blockchain and How Can Blockchain Consulting Help Businesses.pdfWhat is Blockchain and How Can Blockchain Consulting Help Businesses.pdf
What is Blockchain and How Can Blockchain Consulting Help Businesses.pdf
Yodaplus Technologies Private Limited
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Bedrock Data Automation (Preview): Simplifying Unstructured Data ProcessingBedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Zilliz
Supercharge Your Career with UiPath Certifications
Supercharge Your Career with UiPath CertificationsSupercharge Your Career with UiPath Certifications
Supercharge Your Career with UiPath Certifications
DianaGray10
AI Trends and Fun Demos Sothebys Rehoboth Presentation
AI Trends and Fun Demos  Sothebys Rehoboth PresentationAI Trends and Fun Demos  Sothebys Rehoboth Presentation
AI Trends and Fun Demos Sothebys Rehoboth Presentation
Ethan Holland
The Constructor's Digital Transformation Playbook: Reducing Risk With Technology
The Constructor's Digital Transformation Playbook: Reducing Risk With TechnologyThe Constructor's Digital Transformation Playbook: Reducing Risk With Technology
The Constructor's Digital Transformation Playbook: Reducing Risk With Technology
Aggregage
TrustArc Webinar: State of State Privacy Laws
TrustArc Webinar: State of State Privacy LawsTrustArc Webinar: State of State Privacy Laws
TrustArc Webinar: State of State Privacy Laws
TrustArc
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
William Maclyn Murphy McRae - A Seasoned Professional Renowned
William Maclyn Murphy McRae - A Seasoned Professional RenownedWilliam Maclyn Murphy McRae - A Seasoned Professional Renowned
William Maclyn Murphy McRae - A Seasoned Professional Renowned
William Maclyn Murphy McRae
AMER Introduction to ThousandEyes Webinar
AMER Introduction to ThousandEyes WebinarAMER Introduction to ThousandEyes Webinar
AMER Introduction to ThousandEyes Webinar
ThousandEyes
Big Data Analytics Quick Research Guide by Arthur Morgan (PREVIEW)
Big Data Analytics Quick Research Guide by Arthur Morgan (PREVIEW)Big Data Analytics Quick Research Guide by Arthur Morgan (PREVIEW)
Big Data Analytics Quick Research Guide by Arthur Morgan (PREVIEW)
Arthur Morgan
2025-02-24 - AWS meetup - Zilliz presentation.pdf
2025-02-24 - AWS meetup - Zilliz presentation.pdf2025-02-24 - AWS meetup - Zilliz presentation.pdf
2025-02-24 - AWS meetup - Zilliz presentation.pdf
Ivan Tang
UiPath Automation Developer Associate Training Series 2025 - Session 1
UiPath Automation Developer Associate Training Series 2025 - Session 1UiPath Automation Developer Associate Training Series 2025 - Session 1
UiPath Automation Developer Associate Training Series 2025 - Session 1
DianaGray10
5 Must-Use AI Tools to Supercharge Your Productivity
5 Must-Use AI Tools to Supercharge Your Productivity5 Must-Use AI Tools to Supercharge Your Productivity
5 Must-Use AI Tools to Supercharge Your Productivity
cryptouniversityoffi
Transcript: AI in publishing: Your questions answered - Tech Forum 2025
Transcript: AI in publishing: Your questions answered - Tech Forum 2025Transcript: AI in publishing: Your questions answered - Tech Forum 2025
Transcript: AI in publishing: Your questions answered - Tech Forum 2025
BookNet Canada
Caching for Performance Masterclass: The In-Memory Datastore
Caching for Performance Masterclass: The In-Memory DatastoreCaching for Performance Masterclass: The In-Memory Datastore
Caching for Performance Masterclass: The In-Memory Datastore
ScyllaDB
SECURE BLOCKCHAIN FOR ADMISSION PROCESSING IN EDUCATIONAL INSTITUTIONS.pdf
SECURE BLOCKCHAIN FOR ADMISSION PROCESSING IN EDUCATIONAL INSTITUTIONS.pdfSECURE BLOCKCHAIN FOR ADMISSION PROCESSING IN EDUCATIONAL INSTITUTIONS.pdf
SECURE BLOCKCHAIN FOR ADMISSION PROCESSING IN EDUCATIONAL INSTITUTIONS.pdf
spub1985
Deno ...................................
Deno ...................................Deno ...................................
Deno ...................................
Robert MacLean
What is Blockchain and How Can Blockchain Consulting Help Businesses.pdf
What is Blockchain and How Can Blockchain Consulting Help Businesses.pdfWhat is Blockchain and How Can Blockchain Consulting Help Businesses.pdf
What is Blockchain and How Can Blockchain Consulting Help Businesses.pdf
Yodaplus Technologies Private Limited

Jpeg2000 at Wellcome Library

  • 1. JPEG 2000 at the Wellcome Library Christy Henshaw Digitisation Programme Manager Wellcome Library JP2 Summit 12-13 May 2011 Library of Congress
  • 2. The Wellcome Trust A global charitable foundation Achieving extraordinary improvements in human and animal health Supporting the brightest minds in biomedical research and the medical humanities Exploring medicine in historical and cultural contexts
  • 4. The Wellcome Library Collections of books, manuscripts, archives, films and pictures on the history of medicine from the earliest times to the present day .
  • 5. The Wellcome Digital Library pilot, 2010-2013 Genetics and its Modern Foundations A new online resource for everyone interested in the history of human and animal health. Aims build sustainable/expandable mechanism foundation stone for WDL digitise key library holdings - relating to a major Trust challenge area digitise important third party content linked to theme use innovative content and tools to encourage discovery and use explore commercial partnerships enhance access to nontheme material
  • 6. JPEG 2000 conversion scope Wellcome Images image library, legacy images, 300,000 images in the archive Current projects pilot digitisation projects, 7m images 2010 2014 Long-term plans digitisation of large proportion of our collections (mainly special collections), 15m 25m images 2014 and beyond
  • 7. Type of content Printed books early printed books, modern books (monographs), pamphlets, reports Archives personal papers, institutional papers, unpublished works, mostly 20th century Manuscripts unpublished, handwritten manuscript books and related materials, mostly 17th, 18th and 19th century, can be fragile Artworks prints, paintings, posters, drawings, glass slides, etc.
  • 9. Books related to genetic research
  • 12. Decision to adopt JP2 JPEG 2000 was found to answer the following needs: Storage costs 20/30m TIFFs stored on online, backed-up storage = multiple petabytes. Needed something cost-effective. Quality needed a high-quality compressed format that would cover a wide range of content types. Robustness needed a well-established image format with a high chance of long-term support. Practical feasible to use in a Library digitisation workflow.
  • 13. Finding our way Working with JP2 opened up a whole new world reading specifications, finding conversion software, so many choices. Commissioned the report: JPEG 2000 as a Preservation and Access Format for the Wellcome Trust Libr Goal to find a single version of JPEG 2000 that would meet the needs of both long-term preservation and flexible delivery needs.
  • 14. The result Parameter Settings File format Part 1 (.jp2) Compression Lossy (6:1, 10:1) Tiling 1024 x 1024 Progression order RLCP Decomp levels 5 Quality layers 8 Code block size 6, 64x64 Regions of interest No TLM markers Yes Bypass N/A
  • 15. Embedding JP2 Chose LuraWave command line tool Some issues (bugs, or inconvenient implementations) arose, and all have been successfully addressed by LuraTech Created a firm consensus to use JP2 as the format for all stillimage digital imaging (with one or two exceptions) No plans to use JP2 for digital video but never say never Internal information sharing digital archivists, systems administrators, IT department, programme board members External communication and networking
  • 16. Current status, future plans Conversion of all new digital images is now carried out as standard Nearing the final stages of a project to convert 450k image backlog to JP2 (reducing current footprint from 20 Tb to 5.5 Tb) Large projects use lossy JP2, legacy picture library uses lossless Developed a strategy to determine compression levels Currently using the GUI, but will use the command line interface with our new workflow system, streamlining conversion and QA Medium term, will look at automating compression level selection
  • 17. Quality control for compression Visual inspection Color shifts, loss of detail, halo effects, pixelation, blurring, etc. Collection-based, representative sample Test range of compressions with intervals such as 2:1, 4:1, 6:1 Once artefacts are discovered, step back to previous compression ratio Worst-performing image rules, for any particular collection Efficient for homogenous collections less so for heterogenous collections with wide variety of content Archives particularly difficult black and white compresses very well colour drawings and photographs, not so well
  • 18. Establishing the JP2K-UK group Unknown who in the UK were using JPEG 2000, or considering it Unknown who was even interested in JPEG 2000 No one wants to work in a vacuum Discovered a high level of interest: British Library, The National Archives, Oxford, Kings College London, Cambridge and Southampton Universities, Digital Preservation Coalition, commercial companies/consultants Loose affiliation of the like-minded a user group
  • 19. Remit of the JP2K-UK group Initial meeting in December 2009 Everyone had a little knowledge no one knew enough Agreed the need to approach JP2 implementation from practitioners point of view Practitioner meaning those who manage digital imaging strategies and implementation Agreed need to share information and collaborate Discussed ideas for a conference, and creating some guidelines for the user community Wellcome encouraged to write a blog about specific experiences working with JP2
  • 20. Ouputs JPEG 2000 Seminar, held in London in November 2010 > 80 attendees > UK and European speakers and delegates > mostly non-technical audience Advocacy for practitioners needs > discussing and airing the needs and concerns of practitioners has influenced software developers, and even the JPEG Committee > JPEG 2000 at the Wellcome Library blog www.jpeg2000wellcomelibrary.blogspot.com
  • 21. Future plans for JP2K-UK Guidance for practitioners > Human readable > Focus on practicalities > Enable practitioners to make informed choices > Advice on implementation Community building > Case studies > Lessons learned > Networking (nationally and internationally)