This document introduces Open High Availability Cluster (Open HA Cluster), an open source version of Solaris Cluster. It discusses why high availability is important, how Solaris Cluster provides high availability, available cluster agents, and how developers can use and contribute to Open HA Cluster.
2. Outline What is Open High Availability Cluster? Why is High Availability Important? How Solaris Cluster Provides High Availability Cluster Agents Cluster in Action Open-Source Roadmap How to Use Open HA Cluster How to Participate
4. Open High Availability Cluster The open-source code base for Solaris TM Cluster Based on Solaris Cluster 3.2 Hosted by the HA Clusters community group on opensolaris.org http://opensolaris.org/os/community/ha-clusters/ohac/ Code available under the CDDL
5. Why is Sun Open-Sourcing Solaris Cluster? Share Public should have access to the source-base We hope developers will contribute back their agents and other modifications Open Development Increase confidence in the cluster code-base Enhance the code-base with third-party contributions
6. Why is Sun Open-Sourcing Solaris Cluster? Shorten Feedback Loop Easier to incorporate feedback Find and fix bugs quicker Early adoption of new features Because Solaris Cluster is FUN!
8. Why is HA Important? Downtime is costly Failures are inevitable Hardware, Software, Human Error, Disaster With a single physical system, single points of failure are catastrophic Network card dies, CPU misbehaves, disk drive crashes ... HA Clusters automate the recovery process from inevitable failures, minimizing downtime and cost
10. Platform for High Availability Tolerates Single Points of Failure (and some multiple failures) Hardware redundancy with off-the-shelf hardware Robust software HA infrastructure Monitors the health of cluster infrastructure (hosts, shared storage, and network) Orchestrates recovery of applications and cluster infrastructure (shared storage and IP networking) Integrated tightly with the Solaris Operating System Robust membership including quorum to prevent split-brain Disk Fencing ensures Data Integrity in spite of failures
12. HA Failover Service Application failover Within nodes or container Containers failover Failover IP address Failover Service
13. Scalable Service Software Load Balancing Global Network Service Provides Global IP address with failure protection Scalable Service
14. Apache (Scalable) Global Network Service Provides Global IP address with failure protection MySQL (Failover) Example: Apache and MySQL
15. Heartbeats Membership Quorum Disk Fencing Resource Group Manager Resource (application) dependencies Inter RG dependencies RG affinities HA Failover Service Application failover Within nodes or container Containers failover Failover IP address Global File Service Failover File service Scalable Service Software Load Balancing Global Network Service Provides Global IP address with failure protection Solaris Cluster Architecture Monitoring
18. Cluster Agents (Data Services) Applications run on the cluster unmodified Cluster agents are the glue layer between applications and cluster infrastructure Custom agent for each application Interacts with the cluster core through APIs Provides start, stop, and other commands specific to the application to be called by the cluster framework Provides monitor daemon specific to the application Applications cannot break into the high availability business on Solaris Cluster without an agent!
19. Agents Development Several choices available Sun Cluster Agent Builder Generic Data Service (GDS) Data Service Development Library (DSDL) RGM Application Programming Interface (API)
20. Agents Available Include Business Logic Tier HA Sun Java System App Server PE/SE HA Sun Java System App Server EE HA Sun Java System Directory Server* HA Agfa IMPAX HA BEA Weblogic Server Scalable Broadvision One-To-One* HA IBM Websphere MQ HA IBM Websphere MQ Integrator IBM Lotus Notes + HA Oracle Application Server HA SAP liveCache HA SAP J2EE Engine HA SAP Enqueue Server Scalable SAP HA Siebel HA SWIFTAlliance Access HA SWIFTAlliance Gateway HA Sybase Adaptative Server* Web Tier / Presentation HA Sun Java System Web Server HA Sun Java System Messaging Server* HA Sun Java System Message Queue HA Sun Java System Calendar Server* HA Sun Java System Instant Messaging Server* Scalable Sun Java System Web Server HA Apache Web/Proxy Server HA Apache Tomcat Scalable Apache Web/Proxy Server Database Tier Oracle Parallel Server* HA Oracle9i and Oracle 9i RAC* HA Oracle 10g and Oracle 10gRAC* HA Oracle E-business Suite HA Oracle* HA Sybase* HA MySQL HA SAP/MaxDB Database HA PostgreSQL * Not yet open-source Management Infrastructure Tier HA Sun N1 Grid Engine HA Sun N1 Service Provisioning System HA DNS, HA NFS HA DHCP IBM Tivoli + Mainframe Rehosting (MTP)* HA Samba HA Solstice Backup* HA Solaris Container HA Symantec NetBackup* Online Agent list: http://www.opensolaris.org/os/community/ha-clusters/ohac/Documentation/Agents/open-agents/
22. Multiple Source Trees Solaris Cluster consists of four different source trees Agents Core Geographic Edition SCATE
23. What's Open-Source Now Most of the cluster agents Geographic Edition Cluster agents and geo edition G11N CTI for TET test framework Based on SCATE (Sun Cluster Automated Test Environment) Used for both ON and Cluster tests Test suites for agents and geo edition
24. Much More to Come 2008 release source for Sun Cluster core Test suites for core Asynchronously Docs Over three million lines of code to be open-sourced!
26. Solaris Cluster Express A complete version of Solaris Cluster software that runs on Solaris Express Solaris Cluster Express 02/08 (current release) Runs on Solaris Express Developer Edition 1/08 Runs on SPARC and 32 or 64-bit x86 hardware Support available only through community forums Where to get it http://opensolaris.org/os/community/ha-clusters/ohac/downloads/
27. Solaris Cluster 3.2 Most recent release of the Solaris Cluster product Runs on Solaris 9 and 10 Supported on SPARC and 64-bit x86 hardware Try it out for free Where to get it http://sun.com/software/solaris/cluster/get.html
28. Build and Run Agents and Geographic Edition Install Solaris Express Developer or Community Edition Compiler Sun Studio 11, Sun Studio 12, or gcc Build tools ON build tools (SUNWonbld) OHAC build tools (SUNWscbld) Download source tarballs and archives One-line nbuild command Install packages on Solaris Cluster Express or Solaris Cluster 3.2
29. Build, Install, and Run OHACTS Download CTI for TET source from testing community http://opensolaris.org/os/community/testing/testsuites/ctifortet/ Follow CTI for TET instructions to build it Download OHACTS source http://opensolaris.org/os/community/ha-clusters/ohac/downloads/ Build and install OHACTS package http://opensolaris.org/os/community/ha-clusters/ohac/Documentation/Tests/buildntest/ Run OHACTS http://opensolaris.org/os/community/ha-clusters/ohac/Documentation/OHACTS.pdf
31. Where to Start HA Clusters Community Group Web Page http://www.opensolaris.org/os/community/ha-clusters/ Open HA Cluster page http://opensolaris.org/os/community/ha-clusters/ohac/ ha-clusters-discuss@opensolaris.org mailing list http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss http://opensolaris.org/jive/forum.jspa?forumID=195 How to Contribute to Open HA Cluster http://opensolaris.org/os/community/ha-clusters/ohac/contribute/
32. What You Can Do Browse the code with OpenGrok http://src.opensolaris.org/source/xref/ohac/ Join the discussions on ha-clusters-discuss. File bugs http://www.opensolaris.org/bug/report.jspa Fix a bug http://opensolaris.org/os/bug_reports/oss_bite_size/ Contribute to an existing project http://www.opensolaris.org/os/projects/ Start a new project
33. Two new community projects created HA-Informix Cluster Agent for Informix Dynamic Server Created 1 st August 2007 Design Doc, Cheatsheet etc. available http://opensolaris.org/os/project/ha-informix/ HA-xVM Cluster Agent for Solaris xVM guest domains http://opensolaris.org/os/project/ha-xvm/
34. Possible Projects Develop Agents for Adabas, BMC Control-M, BMC Patrol Server, BMC Performance Manager, CA Unicenter, Citrix, Connect Direct, EMC Documentum, FileNet, IBM Tivoli Client, IBM Tivoli Server, IBM Tivoli Workload Scheduler, IBM Websphere Application Server, Informatica, Java CAPS, Java DB, JBOSS, Netbackup, Netbackup Client, Open LDAP, Sleepy cat DB, Sun Java System Identity Server, Sun Java System Portal Server, Sybase Replicator agent, TSM Backup client, UC4 More project suggestions for Open HA Cluster http://opensolaris.org/os/community/ha-clusters/ohac/contribute/agents-suggested-projects/ Or propose your own project!
35. Thank you! Robert Sohigian [email_address] Based on 際際滷s by [email_address] open artwork and icons by chandan: http://blogs.sun.com/chandan