Mostly Used Cassandra Terminologies | Cassandra Glossary
Keeping you updated with latest technology trends, Join DataFlair on Telegram
1. Objective – Cassandra Terminologies
In this Cassandra tutorial, we will go through all the important Cassandra Terminologies. These Terminologies in Cassandra are Cassandra Cluster, Cassandra Keyspace, Cassandra Data model, Cassandra Column Family, SStable and many more.
So, let’s start Cassandra Terminologies tutorial.
2. Cassandra Terminologies
Below are some mostly used Cassandra Terminologies. Let’s discuss them one by one:
The mechanism that ensures that every node contains update data.
It is one of a base for the creation of Cassandra. It was created at Google in 2006 as a high-performance database system.
iii. Bloom Filter
It is an algorithm that determines if an element is a member of a particular set.
Open-source, distributed database management system that handles a big set of data.
Cassandra Cluster is a collection of many nodes. Many clusters together form a database.
It is the basic or the innermost unit in the database. It contains the main data or information.
vii. Column Families
It is a storage unit in keyspace. We can consider it as a table. It contains rows and columns, that in turn, contains the data.
viii. Commit Log
It is basically a median to enter the data in Memtable an SSTable. Before going to the Memtable and SSTable, the data is stored in the commit log. This ensures that at the time of failure, the data is not lost.
It is a process that basically compresses the storage. It basically merges large files and frees up space.
x. Consistency Level
This allows the user to set the number of replicas in a cluster. In these replicas, the data is copied and it increases the redundancies.
Let’s revise Cassandra Features
It is the other base of Cassandra. This along with principles of Bigtable led to the creation of Cassandra.
xii. Failure Detection
It is the process of determining the nodes that have experienced failure in a cluster.
It is the capacity of the database system to handle faults.
It is a storage unit in a database. Basically, it is the outermost container in a node. It contains many column families.
It is a storage table representation. The data flow here after going through the committable. After this, the data flush to disk as an SSTable.
It basically represents a system. They contain many column families that contain data.
Learn Cassandra Applications | Why Cassandra Is So Popular?
Cassandra NoSQL basically determines the collection of non-sequential data. In other words, it is a non-relational data.
xviii. Primary key
It is a row that basically determines the main row.
It is a process in which the data is copied from one node to another. This process ensures redundancies.
xx. Replication Factor
It determines the number of replicas that are created in a cluster of a given data.
xxi. Replication Strategy
Basically, the strategy that used to carry out replication in the cluster.
It is a storage unit. It is a part of column family and contains many columns.
xxiii. Seed Node
It is a kind of a node which is basically used by newly added nodes to get up and running.
A Sort string table. It allows high performance and can be compacted.
It is a column which is basically a map of other columns. In one kind it contains all the columns.
Read Cassandra Data Model | How Cassandra Stores Data
An API that uses when user deals with the table as a storage unit.
Timestamp are column values that basically is given by the user. It determines the time of a column details.