Make delicious recipes!

Table of Contents

  1. Cassandra Introduction

Introduction to Cassandra



Cassandra

  1. Massively scalable
  2. Linearly scalable (If 2 nodes handle x traffic, then 4 nodes handle 2x!)
  3. NoSQL Database
  4. Failed nodes can be replaced with no downtime.
  5. Integrates with Hadoop and has MapReduce support. Also supports Apache Pig and Hive.
  6. Can store structured and unstructured data alike.



Architecture

  1. Masterless architecture - meaning all nodes are the same.
  2. Data automatically distributed among all nodes in the ring.
  3. Replication factor is configurable.
  4. Replication can be configured to work across data-centers also.
  5. Failed nodes' data is automatically replicated to maintain replication factor.
  6. Cassandra emphasizes denormalization.



Language Features (CQL 3.0)

  1. CQL has drivers available in Java, Python and Node.JS
  2. Does not support joins or subqueries, except for batch analysis through Hive.
  3. Lightweight transactions using the IF keyword in INSERT and UPDATE statements.
  4. Initial support for triggers.



Data Model

  1. Cassandra is essentially a hybrid between a key-value and a column-oriented (or tabular) database.
  2. Cassandra's column-family resembles a Table in RDBMS.
  3. Column families contain rows and columns.
  4. Each row has multiple columns, each of which has a name, value, and a timestamp.
  5. Different rows in the same column family do not have to share the same set of columns.
  6. Columns may be added to one or multiple rows at any time.
  7. Each key in Cassandra identifies one row with multiple columns (like a Primary Key).








Like us on Facebook to remain in touch
with the latest in technology and tutorials!


Got a thought to share or found a
bug in the code?
We'd love to hear from you:

Name:
Email: (Your email is not shared with anybody)
Comment:

Facebook comments:

Site Owner: Sachin Goyal