ºÝºÝߣ

ºÝºÝߣShare a Scribd company logo
2
Most read
6
Most read
9
Most read
GPFS: General Parallel File
System
Why is it needed?
What is GPFS and its features?
Where it is being used?
Why GPFS is needed?
Growth Rate of Components
? ? CPU speed performance has increased 8
to 10 times.
? ? DRAM speed performance has increased 7
to 9 times.
? ? Network speed performance has increased
100 times.
? ? Bus speed performance has increased 20
times.
? ? But Hard disk drive (HDD) speed
performance has increased only 1.2 times.
Three Important Functions
of Enterprise Storage
? ? Store data
? ? Protect data from being lost
? ? Feed data to the computer¡¯s processors
(so they can keep doing work)
Existing Solutions Inability
? DAS, NAS, SAN [alone]
? Many data centers have become victims of
¡°filer-sprawl¡±
? Data administration and management
(such as migration, backups, archiving)
costs to skyrocket!
? I/O performance & application workflow
What is GPFS
? The General Parallel File System (GPFS) is a high
performance clustered file system. It can be
deployed in shared disk or shared nothing
distributed parallel modes.
? Developer(s): IBM
? Operating system: AIX / Linux / Windows Server
? License: Proprietary
? System Introduced: 1998 (AIX)
? Max. volume size: 8 YB
? Max. file size: 8 EB
? Max. number of files: 264 per file system
? File system permissions: POSIX
GPFS Current Usage
? It is used by many of the world's largest commercial
companies, as well as some of the supercomputers on
the Top 500 List.
? For example, GPFS was the filesystem of the ASC
Purple Supercomputer which was composed of more
than 12,000 processors and 2 petabytes of total disk
storage spanning more than 11,000 disks.
? IBM,s GPFS is extensively used across multiple
industries like Government, Oil and Gas, Life Sciences,
Media/Entertainment, Financial services
GPFS Features
Standard file system interface with POSIX semantics
¨C Metadata on shared storage
¨C Distributed locking for read/write semantics
? Highly scalable
¨C High capacity (up to 2^99 bytes file system size, up to 2^63 files per file
system)
¨C High throughput (TB/s)
¨C Wide striping
¨C Large block size (up to 16MB)
¨C Multiple nodes write in parallel
? Advanced data management
¨C ILM (storage pools), Snapshots
¨C Backup HSM (DMAPI)
¨C Remote replication, WAN caching
? High availability
¨C Fault tolerance (node, disk failures)
¨C On-line system management (add/remove nodes, disks, ...)
References
? GPFS official homepage
? GPFS resources (including download)
? GPFS at Almaden
? GPFS Mailing List
? GPFS User Group
? IBM GPFS Product Documentation
? IBM GPFS Wiki
Ad

Recommended

Ibm spectrum scale fundamentals workshop for americas part 1 components archi...
Ibm spectrum scale fundamentals workshop for americas part 1 components archi...
xKinAnx
?
DAS RAID NAS SAN
DAS RAID NAS SAN
Ghassen Smida
?
VMware HCI solutions - 2020-01-16
VMware HCI solutions - 2020-01-16
David Pasek
?
Lecture5 virtualization
Lecture5 virtualization
hktripathy
?
Oracle ACFS High Availability NFS Services (HANFS) Part-I
Oracle ACFS High Availability NFS Services (HANFS) Part-I
Anju Garg
?
Ibm spectrum scale fundamentals workshop for americas part 8 spectrumscale ba...
Ibm spectrum scale fundamentals workshop for americas part 8 spectrumscale ba...
xKinAnx
?
Gpfs introandsetup
Gpfs introandsetup
asihan
?
Ibm spectrum scale fundamentals workshop for americas part 5 ess gnr-usecases...
Ibm spectrum scale fundamentals workshop for americas part 5 ess gnr-usecases...
xKinAnx
?
Room 3 - 1 - Nguy?n Xu?n Tr??ng L?m - Zero touch on-premise storage infrastru...
Room 3 - 1 - Nguy?n Xu?n Tr??ng L?m - Zero touch on-premise storage infrastru...
Vietnam Open Infrastructure User Group
?
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Danielle Womboldt
?
Storage Technology Overview
Storage Technology Overview
nomathjobs
?
Ibm spectrum scale fundamentals workshop for americas part 4 spectrum scale_r...
Ibm spectrum scale fundamentals workshop for americas part 4 spectrum scale_r...
xKinAnx
?
Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural Components
Denodo
?
ZFS
ZFS
mewandalmeida
?
VMware NSX 101: What, Why & How
VMware NSX 101: What, Why & How
Aniekan Akpaffiong
?
IBM Spectrum Scale for File and Object Storage
IBM Spectrum Scale for File and Object Storage
Tony Pearson
?
Virtualization & cloud computing
Virtualization & cloud computing
Soumyajit Basu
?
Apache Hudi: The Path Forward
Apache Hudi: The Path Forward
Alluxio, Inc.
?
Storage Basics
Storage Basics
Murali Rajesh
?
Snowflake Datawarehouse Architecturing
Snowflake Datawarehouse Architecturing
Ishan Bhawantha Hewanayake
?
IBM Spectrum Scale Authentication for File Access - Deep Dive
IBM Spectrum Scale Authentication for File Access - Deep Dive
Shradha Nayak Thakare
?
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform System
James Serra
?
The Data Center Network Evolution
The Data Center Network Evolution
Cisco Canada
?
Network virtualization
Network virtualization
Damian Parniewicz
?
VMware Presentation
VMware Presentation
Emirates Computers
?
Introduction to snowflake
Introduction to snowflake
Sunil Gurav
?
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Karan Singh
?
CDW: SAN vs. NAS
CDW: SAN vs. NAS
Spiceworks
?
IBM general parallel file system - introduction
IBM general parallel file system - introduction
IBM Danmark
?
Gfs
Gfs
Shahbaz Sidhu
?

More Related Content

What's hot (20)

Room 3 - 1 - Nguy?n Xu?n Tr??ng L?m - Zero touch on-premise storage infrastru...
Room 3 - 1 - Nguy?n Xu?n Tr??ng L?m - Zero touch on-premise storage infrastru...
Vietnam Open Infrastructure User Group
?
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Danielle Womboldt
?
Storage Technology Overview
Storage Technology Overview
nomathjobs
?
Ibm spectrum scale fundamentals workshop for americas part 4 spectrum scale_r...
Ibm spectrum scale fundamentals workshop for americas part 4 spectrum scale_r...
xKinAnx
?
Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural Components
Denodo
?
ZFS
ZFS
mewandalmeida
?
VMware NSX 101: What, Why & How
VMware NSX 101: What, Why & How
Aniekan Akpaffiong
?
IBM Spectrum Scale for File and Object Storage
IBM Spectrum Scale for File and Object Storage
Tony Pearson
?
Virtualization & cloud computing
Virtualization & cloud computing
Soumyajit Basu
?
Apache Hudi: The Path Forward
Apache Hudi: The Path Forward
Alluxio, Inc.
?
Storage Basics
Storage Basics
Murali Rajesh
?
Snowflake Datawarehouse Architecturing
Snowflake Datawarehouse Architecturing
Ishan Bhawantha Hewanayake
?
IBM Spectrum Scale Authentication for File Access - Deep Dive
IBM Spectrum Scale Authentication for File Access - Deep Dive
Shradha Nayak Thakare
?
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform System
James Serra
?
The Data Center Network Evolution
The Data Center Network Evolution
Cisco Canada
?
Network virtualization
Network virtualization
Damian Parniewicz
?
VMware Presentation
VMware Presentation
Emirates Computers
?
Introduction to snowflake
Introduction to snowflake
Sunil Gurav
?
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Karan Singh
?
CDW: SAN vs. NAS
CDW: SAN vs. NAS
Spiceworks
?
Room 3 - 1 - Nguy?n Xu?n Tr??ng L?m - Zero touch on-premise storage infrastru...
Room 3 - 1 - Nguy?n Xu?n Tr??ng L?m - Zero touch on-premise storage infrastru...
Vietnam Open Infrastructure User Group
?
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Danielle Womboldt
?
Storage Technology Overview
Storage Technology Overview
nomathjobs
?
Ibm spectrum scale fundamentals workshop for americas part 4 spectrum scale_r...
Ibm spectrum scale fundamentals workshop for americas part 4 spectrum scale_r...
xKinAnx
?
Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural Components
Denodo
?
IBM Spectrum Scale for File and Object Storage
IBM Spectrum Scale for File and Object Storage
Tony Pearson
?
Virtualization & cloud computing
Virtualization & cloud computing
Soumyajit Basu
?
Apache Hudi: The Path Forward
Apache Hudi: The Path Forward
Alluxio, Inc.
?
IBM Spectrum Scale Authentication for File Access - Deep Dive
IBM Spectrum Scale Authentication for File Access - Deep Dive
Shradha Nayak Thakare
?
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform System
James Serra
?
The Data Center Network Evolution
The Data Center Network Evolution
Cisco Canada
?
Introduction to snowflake
Introduction to snowflake
Sunil Gurav
?
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Karan Singh
?

Similar to IBM GPFS (20)

IBM general parallel file system - introduction
IBM general parallel file system - introduction
IBM Danmark
?
Gfs
Gfs
Shahbaz Sidhu
?
Gfs sosp2003
Gfs sosp2003
î£çù ´Þ
?
GPFS Solution Brief
GPFS Solution Brief
IBM India Smarter Computing
?
Distributed File System
Distributed File System
Ntu
?
Distributed file systems
Distributed file systems
Sri Prasanna
?
Distributed Filesystems Review
Distributed Filesystems Review
Schubert Zhang
?
Swiss National Supercomputing Center
Swiss National Supercomputing Center
IBM India Smarter Computing
?
GFS presenttn.pptx
GFS presenttn.pptx
EngTennysonSigauke
?
4. linux file systems
4. linux file systems
Marian Marinov
?
Course 102: Lecture 27: FileSystems in Linux (Part 2)
Course 102: Lecture 27: FileSystems in Linux (Part 2)
Ahmed El-Arabawy
?
Topic 11: Google Filesystem
Topic 11: Google Filesystem
Zubair Nabi
?
PostgreSQL Portland Performance Practice Project - Database Test 2 Filesystem...
PostgreSQL Portland Performance Practice Project - Database Test 2 Filesystem...
Mark Wong
?
Google file system GFS
Google file system GFS
zihad164
?
Google file system
Google file system
Anurag Gautam
?
XFS.ppt
XFS.ppt
DmitryIg
?
8 1-os file system implementation
8 1-os file system implementation
Gol D Roger
?
Cluster filesystems
Cluster filesystems
Marian Marinov
?
Seminar Report on Google File System
Seminar Report on Google File System
Vishal Polley
?
Network File System in Distributed Computing
Network File System in Distributed Computing
Chandan Padalkar
?
IBM general parallel file system - introduction
IBM general parallel file system - introduction
IBM Danmark
?
Distributed File System
Distributed File System
Ntu
?
Distributed file systems
Distributed file systems
Sri Prasanna
?
Distributed Filesystems Review
Distributed Filesystems Review
Schubert Zhang
?
Course 102: Lecture 27: FileSystems in Linux (Part 2)
Course 102: Lecture 27: FileSystems in Linux (Part 2)
Ahmed El-Arabawy
?
Topic 11: Google Filesystem
Topic 11: Google Filesystem
Zubair Nabi
?
PostgreSQL Portland Performance Practice Project - Database Test 2 Filesystem...
PostgreSQL Portland Performance Practice Project - Database Test 2 Filesystem...
Mark Wong
?
Google file system GFS
Google file system GFS
zihad164
?
8 1-os file system implementation
8 1-os file system implementation
Gol D Roger
?
Seminar Report on Google File System
Seminar Report on Google File System
Vishal Polley
?
Network File System in Distributed Computing
Network File System in Distributed Computing
Chandan Padalkar
?
Ad

IBM GPFS

  • 1. GPFS: General Parallel File System Why is it needed? What is GPFS and its features? Where it is being used?
  • 2. Why GPFS is needed?
  • 3. Growth Rate of Components ? ? CPU speed performance has increased 8 to 10 times. ? ? DRAM speed performance has increased 7 to 9 times. ? ? Network speed performance has increased 100 times. ? ? Bus speed performance has increased 20 times. ? ? But Hard disk drive (HDD) speed performance has increased only 1.2 times.
  • 4. Three Important Functions of Enterprise Storage ? ? Store data ? ? Protect data from being lost ? ? Feed data to the computer¡¯s processors (so they can keep doing work)
  • 5. Existing Solutions Inability ? DAS, NAS, SAN [alone] ? Many data centers have become victims of ¡°filer-sprawl¡± ? Data administration and management (such as migration, backups, archiving) costs to skyrocket! ? I/O performance & application workflow
  • 6. What is GPFS ? The General Parallel File System (GPFS) is a high performance clustered file system. It can be deployed in shared disk or shared nothing distributed parallel modes. ? Developer(s): IBM ? Operating system: AIX / Linux / Windows Server ? License: Proprietary ? System Introduced: 1998 (AIX) ? Max. volume size: 8 YB ? Max. file size: 8 EB ? Max. number of files: 264 per file system ? File system permissions: POSIX
  • 7. GPFS Current Usage ? It is used by many of the world's largest commercial companies, as well as some of the supercomputers on the Top 500 List. ? For example, GPFS was the filesystem of the ASC Purple Supercomputer which was composed of more than 12,000 processors and 2 petabytes of total disk storage spanning more than 11,000 disks. ? IBM,s GPFS is extensively used across multiple industries like Government, Oil and Gas, Life Sciences, Media/Entertainment, Financial services
  • 8. GPFS Features Standard file system interface with POSIX semantics ¨C Metadata on shared storage ¨C Distributed locking for read/write semantics ? Highly scalable ¨C High capacity (up to 2^99 bytes file system size, up to 2^63 files per file system) ¨C High throughput (TB/s) ¨C Wide striping ¨C Large block size (up to 16MB) ¨C Multiple nodes write in parallel ? Advanced data management ¨C ILM (storage pools), Snapshots ¨C Backup HSM (DMAPI) ¨C Remote replication, WAN caching ? High availability ¨C Fault tolerance (node, disk failures) ¨C On-line system management (add/remove nodes, disks, ...)
  • 9. References ? GPFS official homepage ? GPFS resources (including download) ? GPFS at Almaden ? GPFS Mailing List ? GPFS User Group ? IBM GPFS Product Documentation ? IBM GPFS Wiki