Course Outline

  • Section 1: Introduction to Big Data / NoSQL
    • NoSQL overview
    • CAP theorem
    • When is NoSQL appropriate
    • Columnar storage
    • NoSQL ecosystem
  • Section 2 : Cassandra Basics
    • Design and architecture
    • Cassandra nodes, clusters, datacenters
    • Keyspaces, tables, rows and columns
    • Partitioning, replication, tokens
    • Quorum and consistency levels
    • Labs : interacting with cassandra using CQLSH
  • Section 3: Data Modeling – part 1
    • introduction to CQL
    • CQL Datatypes
    • creating keyspaces & tables
    • Choosing columns and types
    • Choosing primary keys
    • Data layout for rows and columns
    • Time to live (TTL)
    • Querying with CQL
    • CQL updates
    • Collections (list / map / set)
    • Labs : various data modeling exercises using CQL ; experimenting with queries and supported data types
  • Section 4: Data Modeling – part 2
    • Creating and using secondary indexes
    • composite keys (partition keys and clustering keys)
    • Time series data
    • Best practices for time series data
    • Counters
    • Lightweight transactions (LWT)
    • Labs : creating and using indexes;  modeling time series data
  • Section 5 : Cassandra Internals
    • understand Cassandra design under the hood
    • sstables, memtables, commit log
  • Section 6: Administration
    • Hardware selection
    • Cassandra distributions
    • Cassandra Nodes Communication
    • Writing and Reading data to/from the storage engine
    • Data directories
    • Anti-entropy operations
    • Cassandra Compaction
    • Choosing and Implementing compaction strategies
    • Cassandra best practices (compaction, garbage collection,)
    • Creating a test Cassandra instance with low memory footprint
    • Troubleshooting tools and tips
    • Lab : students install Cassandra, run benchmarks

Requirements

  • comfortable in Linux environment (navigating command line, editing files with vi / nano)
  • For on-site courses, a laptop or desktop with 8 GB of RAM
  • For remote courses, a working Cassandra lab will be provided, and nothing is needed except a web browser
 14 Hours

Number of participants



Price per participant

Testimonials (2)

Related Courses

Apache Cassandra 4.0

14 Hours

Fundamentals of Cassandra DB

21 Hours

Cassandra for Developers

21 Hours

Cassandra for Developers - Bespoke

21 Hours

Data Management

35 Hours

Pivotal Greenplum for Developers

21 Hours

MarkLogic Server

14 Hours

SAP ASE for Database Administrators

28 Hours

Access Intermediate

14 Hours

Access VBA

21 Hours

Access Advanced

21 Hours

Microsoft Access - download the data

14 Hours

Access - podstawy

14 Hours

Access - Data Base Designing

21 Hours

SQL in Microsoft Access

14 Hours

Related Categories

1