Apache Cassandra is a second-generation distributed database originally open-sourced by Facebook. Its write-optimized shared-nothing architecture results in excellent performance and scalability.
Cassandra moves away from the master-slave model and instead uses a peer-to-peer model. This means there is no single master but all the nodes are potentially masters. This makes the writes and reads extremely scalable and even allows nodes to function in cases of partition tolerance.
The large volume and variety of data that today's businesses process necessitates the need for a highly available, low latency database. Cassandra provides this solution by permitting high-speed reads and writes across a replicated, distributed system.
- Benchmark read and write operations
- Recognize different types of failure
- Fix a failed or partially failed cluster
- Identify potential performance bottlenecks
- Perform live schema updates
- Perform move operations
- Plan and perform cluster-wide operations
- Monitor compaction, repair, and hinted handoff
- Architect and engineer Cassandra databases for competitive advantage
- Model data in Cassandra based on query patterns
- Access Cassandra databases using CQL and Java
- Create a balance between read/write speed and data consistency
- Integrate Cassandra with Hadoop, Pig and Hive
- Implement commonly used Cassandra design patterns
Apache Cassandra - Administrator workshop is designed for Administrators with basic knowledge of databases.
Database Administrators, Data Analytics professionals, Data architects, Managers