Top 50 Awesome List

Anant/awesome-cassandra

Databases  2 months ago  129
A curated list of the best resources in the Cassandra community.
View byDAY/WEEK/README
View on Github

Aug 28th

General

Cassandra Compliant Databases on JVM

  • DataStax Enterprise - Most widely used commercial distribution of Cassandra, integrated with Apache Spark (for SparkSQL, analytics), Apache Solr (for secondary index), Apache TinkerPop based Graph stored in Cassandra, and OpsCenter.
  • General

    Cassandra as a Service / Managed Cassandra Based on Open Source Cassandra

  • DataStax Astra - DataStax Astra Cassandra as a Service running on the Kubernetes operator Cassandra available on AWS and GCP.
  • Apr 30th

    Resources

    Blogs

  • The Netflix Tech Blog - Learn about Netflix’s world class engineering efforts, company culture, product developments and more.
  • Spotify R&D / Engineering Blog : Cassandra - Cassandra related posts on Spotify's official technology blog.
  • Apr 29th

    General

    Cassandra Deployment

  • tlp-cluster, a tool for launching Cassandra clusters in AWSstars16 - Provisioning tool for Cassandra designed for developers looking to both benchmark and test the correctness of Cassandra. It assists with builds and starting instances on AWS.
  • Setting Up Cassandra Cluster Through Ansible - Guide detailing how to set up a Cassandra cluster with automation using Ansible.
  • Running Cassandra on DC/OS (Mesos) - Blog that shows how to setup DC/OS in the Amazon cloud, how to install Cassandra on a DC/OS cluster, and finally new ways to interact with and Cassandra after it is installed.
  • General

    Cassandra Deployment on Docker / Containerized Cassandra

  • Docker-Cassandrastars212 - Set of scripts and config files to run a Cassandra cluster from Docker.
  • Cassandra Dockerstars7 - Instaclustr public docker image for Cassandra. It contains docker images for Cassandra 3.0 and 3.11.1.
  • Cassandra / Elassandra Dockerstars0 - Cassandra and Elassandra docker images.Cass Operator is maintained by a team at DataStax and it is part of what powers DataStax Astra.
  • Databases

    Custom Time Series

  • Hawkular.org - Time series / distributed tracing database powered by Cassandra by Redhat.
  • Newts - Time-series data store based on Cassandra.
  • Resources

    Blogs

  • Datastax - DataStax, Inc. is a data management company that provides commercial support, software, and cloud database-as-a-service based on Cassandra.
  • Codecentric: Cassandra - Codecentric is an IT consulting company, these are their blog posts surrounding the topic of Cassandra.
  • Pythian: Cassandra - Pythian provides data and cloud-related services. The company provides services for Oracle, SQL Server, MySQL, Hadoop, Cassandra and other databases and their supporting infrastructure.
  • Instaclustr - Managed and supported open source solutions for Cassandra, Kafka, Elasticsearch & Redis.
  • OpenCredo:Cassandra - OpenCredo is a consulting company that helps clients make informed decisions around cloud native and open source technologies, as well as public cloud services.
  • DOAN DuyHai's Blog: Cassandra - Duyhai Doan is a freelance big data and cloud architect who values sharing knowledge and contributing to the technology community.
  • Amy Tobert - Amy Tobert is a full-stack engineer & leader with passion for sustainable systems and people-centered leadership. Her blog details different Cassandra deployments amont other topics.
  • Christopher Batey: Cassandra - Christopher Batey is a software engineer of over 15 years and is a primary contributor to Akka and occasional contributor to Cassandra.
  • Distributed Bytes: Cassandra - Tim Ojo is the creator of Distributed Bytes and software engineer at Capital one. These are a collection of his posts surrounding the topic of Cassandra.
  • Ryan Svilha - Ryan Svilha is a principle engineer at DataStax. His blog posts covers topics surround Cassandra and associated tools.
  • Anant - Anant builds and manages business platforms of which they connect customer experiences and information systems with real-time data platforms.
  • General

    Cassandra Use Cases

  • An Odyssey of Cassandra - Older article that has republished but talks about transitioning from SQL to NoSQL with Cassandra.
  • General

    Cassandra Data Modeling

  • CQL: This is not the SQL you are Looking For - Presentation that explores and explains the differences between the CQL and SQL languages.
  • Spring Data Cassandra Examples - Maven project that contains examples showcasing the features and functionality of the Spring Data Cassandra project.
  • General

    Cassandra Architecture

  • Guide to Cassandra Thread Pools - Guide that provides a description of the different thread pools and how to monitor them. Includes what to alert on, common issues and solutions. Old but very useful reference.
  • The Gossip Protocol - Inside Cassandra. - Good visual explanation of how Cassandra keeps consistent.
  • Introduction To The Cassandra 3.x Storage Engine - The 3.x storage engine makes it easier for Cassandra to get bytes off disk.
  • Dropping columns in Cassandra 3.0 - Blog post describing the steps Cassandra takes when a column is dropped.
  • About Deletes and Tombstones in Cassandra - Deleting distributed and replicated data from a system such as Cassandra is far trickier than in a relational database.
  • Undetectable tombstones in Cassandra - Indepth analysis of cell and range tombstones.
  • Understanding the Nuance of Compaction in Cassandra - Overview of how Cassandra manages data on disk.
  • Improving Cassandra's Front Door and Backpressure - Explore how an incoming request was processed by Cassandra before, see what they changed, and look at new relevant configuration knobs available.
  • Cassandra Architecture - High level overview of Cassandra from Instaclustr.
  • The 10 Things I hate about Cassandra - Do you really want to use Cassandra? Learn why not to use it.
  • General

    Cassandra Monitoring

  • Resources for Monitoring Datastax, Cassandra, Spark, & Solr Performance - Blog post detailing different types of monitoring tools and their purpose.
  • Monitoring Cassandra With Grafana And Influx DB - Blog post explaining how to set up Cassandra monitoring with influxDB and Grafana.
  • Cassandra Monitoring - Introduction (1/2) - Blog post detailing how Cassandra metrics can be gathered.
  • Monitoring Cassandra using Intel Snap and Grafana - Blog post describing how to monitor Cassandra using the Intel Snap open source telemetry framework.
  • Cassandra Monitoring Best Practice Guide - Blog post that aims to touch all the important aspects of Cassandra monitoring.
  • General

    Cassandra Performance Tuning

  • Modeling real life workloads with cassandra-stress is hard - Blog post detailing caveats with cassandra-stress when modeling real workloads.
  • Cassandra Node Diagnostics Toolsstars51 - Monitoring and audit power kit for Cassandra.
  • Performing User Defined Compactions in Cassandra - Documenting a process by which we tell Cassandra to create a compaction task for one or more tables explicitly.
  • General

    Cassandra Deployment on Kubernetes / Kubernetized Cassandra

  • CassKop - Cassandra operator for Kubernetesstars169 - Kubernetes operator automates the Cassandra operations such as deploying a new rack aware cluster, adding/removing nodes, configuring the C and JVM parameters, upgrading JVM and C versions. Written in Go.
  • K8ssandra.io - Kubernetes + Cassandra - K8ssandra provides a production-ready platform for running Cassandra on Kubernetes. This includes automation for operational tasks such as repairs, backups, and monitoring.
  • Datastax - Cassandra Kubernetes Operatorstars235 - Datastax's Cassandra Kubernetes Operator which supports Datastax as well as open source Cassandra containers on Kubernetes.
  • Rook.io - Cassandra on Kubernetes - Rook is an open source cloud-native storage orchestrator, providing the platform, framework, and support for a diverse set of storage solutions to natively integrate with cloud-native environments. They have a special operator for Cassandra amongst other providers.
  • Kudo Cassandar Operatorstars10 - The KUDO Cassandra Operator makes it easy to deploy and manage Cassandra on Kubernetes.
  • General

    Integrating with Cassandra

  • Building a Streaming Data Hub with Elasticsearch, Kafka and Cassandra - Blog post detailing how a streaming analytics system on top of open source, big data components can be done.
  • General

    Spark

  • DataStax Spark Cassandra Connectorstars1.8k - Library that lets you expose Cassandra tables as Spark RDDs, write Spark RDDs to Cassandra tables, and execute arbitrary CQL queries in your Spark applications.
  • Packages

    Libraries

  • DataStax Java Driverstars1.2k - Java client driver for Cassandra.
  • DataStax C++ Driverstars346 - Modern, feature-rich, and highly tunable C/C++ client library for Cassandra (1.2+) and DataStax Enterprise (3.1+) using exclusively Cassandra's native protocol and Cassandra Query Language v3.
  • DataStax Python Driverstars1.3k - Modern, feature-rich and highly-tunable Python client library for Cassandra (2.1+) using exclusively Cassandra's binary protocol and Cassandra Query Language v3.
  • DataStax Ruby Driverstars226 - Ruby client driver for Cassandra. This driver works exclusively with the Cassandra Query Language version 3 (CQL3) and Cassandra's native protocol.
  • DataStax Node.js Driverstars1.1k - Modern, feature-rich and highly tunable Node.js client library for Cassandra (1.2+) and DataStax Enterprise (3.1+) using exclusively Cassandra's binary protocol and Cassandra Query Language v3.
  • DataStax C# Driverstars509 - Modern, feature-rich and highly tunable C# client library for Cassandra (1.2+) and DataStax Enterprise (3.1+) using exclusively Cassandra's binary protocol and Cassandra Query Language v3.
  • DataStax PHP Driverstars417 - DataStax PHP Driver for Cassandra.
  • Achilles - Achilles is an open source Persistence Manager for Cassandra,with the features like Advanced bean mapping (compound primary key, composite partition key, timeUUID, ect),Native collections and map support,and so.
  • phpcassastars250 - PHP client library for Cassandra.
  • Caffinitas - Caffinitas is an advanced object mapper for Cassandra which has been especially designed to work with Datastax Java Driver 2.1+ against Cassandra 2.1, 2.0 or 1.2.
  • Spring Data for Cassandra - Spring Data for Cassandra offers a familiar interface to those who have used other Spring Data modules in the past.
  • Packages

    Tools

  • Ansible-Galaxy: Cassandra GitHubstars10 - Collection called cassandra that aims at providing all Ansible modules allowed to interact with Cassandra.
  • Hackolade - Visual data modeling tool for NoSQL databases and stuctures like Cassandra, ElasticSearch, Graph DBs, JSON, APIs.
  • Datastax - Management API for Cassandrastars44 - The Management API is a sidecar service layer that attempts to build a well supported set of operational actions on Cassandra® nodes that can be administered centrally.
  • RazorSQL - Multi DB Manager Tool - Multi-db tool for Linux, Mac, and Windows that works with Cassandra.
  • KDM - The Kashlev Data Modeler - Automated big data modeling tool for Cassandra.
  • Cassandra Reaper - Automated repairs for Cassandra. Supports all versions.
  • cstar perfstars69 - Cassandra performance testing platform.
  • Spark Cassandra Stressstars25 - Tool for testing the DataStax Spark Connector against Cassandra or DSE.
  • Cassalogstars14 - Cassalog is a schema change management library and tool for Cassandra that can be used with applications running on the JVM.
  • Cassandra-web - Web interface for Cassandra.
  • tlp-cluster - Provisioning tool for Cassandra designed for developers looking to benchmark and test Cassandra. It assists with builds and starting instances on AWS.
  • Helenosstars164 - Free web based environment that simplifies a data exploring & schema management with Cassandra database.
  • Cassandra-Migrationstars50 - Cassandra / DataStax Enterprise database migration (schema evolution) library.
  • Instaclustr Kerberos pluginstars5 - GSSAPI authentication provider for Cassandra.
  • Packages

    Logging /Metrics

  • ctopstars2 - Very simple console tool for monitoring column families read/write activities at remote cassandra host.
  • Metrics Collector for Cassandrastars74 - Metric collection and Dashboards for Cassandra (2.2, 3.0, 3.11, 4.0) clusters. Comes with dashboards for Graphana.
  • Cassandra Log Toolsstars8 - Simple scripts for working with Cassandra logs.
  • Resources

    Videos

  • Monitoring Cassandra: Don't Miss a Thing (Alain Rodriguez, The Last Pickle) | C* Summit 2016 - Talk given by Alain Rodriguez, Consultant at The Last Pickle, discussing what to monitor in Cassandra, how, and why.
  • Cassandra.Lunchstars5 - Collection of all past Cassandra.Lunch webinars including videos, slides, and Blog posts surrounding all topics Cassandra.
  • General

    Cassandra History

  • ZDNet: Cassandra Turns 10 - Highlights of the growth of Cassandra over it's first 10 years.
  • General

    Cassandra Compliant Databases on JVM

  • DDAC/Luna - Datastax Distribution of Cassandra, a production ready distribution with a bulk loader supported by Datastax. DDAC is Deprecated now, but Datastax is still supporting Cassandra with it's new Luna Service.
  • General

    Cassandra Compliant Databases on C++

  • ScyllaDBstars7.2k - NoSQL data store using the seastar framework, compatible with Cassandra.
  • General

    Cassandra as a Service / Managed Cassandra Based on Open Source Cassandra

  • Instaclustr Managed Cassandra as a Service - Instaclustr provides a fully managed and SOC 2 certified hosted & managed service for Cassandra® on AWS, Azure, GCP and IBM Cloud.
  • Aiven for Cassandra - Aiven for Cassandra is a managed and hosted distributed NoSQL database providing scalability, high availability, and excellent fault tolerance. Cassandra as a Service is available on Google Cloud Platform, Amazon Web Services, Microsoft Azure, DigitalOcean, and UpCloud.
  • Microsoft Azure Managed Instance for Cassandra - Azure Managed Instance for Cassandra provides automated deployment and scaling operations for managed open-source Cassandra datacenters. It accelerates hybrid scenarios and reduces ongoing maintenance.
  • General

    Cassandra as a Service / Managed Cassandra Based on Proprietary Technology

  • Microsoft Azure Cosmos DB: Cassandra API - Azure Cosmos DB provides the Cassandra API (preview) for applications that are written for Cassandra that need premium capabilities.
  • Amazon Keyspaces for Cassandra - Amazon Web Services (AWS) Amazon Keyspaces for Cassandra provides a CQL compliant access to a "Serverless" auto-scaling datastore.
  • General

    Using Cassandra

  • The LIMIT Clause in Cassandra might not work as you think - Blog post for the considerations on the efficiency of the LIMIT clause.
  • Top 5 reasons to use the Cassandra Database - Few good reasons why you'd want to consider Cassandra.
  • Cassandra Use Cases: When to use and when not to use Cassandra - Practical guide for when to use and when not to use Cassandra.
  • Cassandra Database (Guide) - Great guide to learn about Cassandra, from Instaclustr.
  • General

    Cassandra Maintenance

  • Cassystars39 - Simple and integrated backup tool for Cassandra.
  • Medusastars163 - Cassandra backup system.
  • General

    Cassandra Security

  • Securing Cassandra with Application Level Encryption - Discusses how to do application level data encryption to properly manage secure information in Cassandra.
  • LDAP Authenticator for Cassandrastars20 - Pluggable authentication implementation for Cassandra, providing a way to authenticate and create users based on a configured LDAP server.
  • Databases

    Miscellaneous

  • Apache/Usergridstars997 - Open source Backend as a Service (BaaS) on Cassandra, Elasticsearch with client SDKs for iOS/Android/.NET/Java.
  • Packages

    Open Source Applications

  • Cassandra Cluster Adminstars205 - Cassandra Cluster Admin is a GUI tool to help people administrate their Cassandra cluster.
  • CCM: Cassandra Cluster Manager)stars1.2k - Script/library to create, launch and remove an Cassandra cluster on localhost.
  • CStarstars244 - Cassandra cluster orchestration tool for the command line.
  • Resources

    Documentation

  • Cassandra Documentation - Definitive documentation for all published versions.
  • Resources

    Communities

  • Cassandra Users Mailing List
  • Cassandra Developers Mailing List
  • Cassandra Commits Mailing List
  • Resources

    Slides

  • HAPI Cassandrastars5 - Simple REST API with hapi Node.js framework on top of a Cassandra database.
  • Apr 22nd

    General

    Cassandra Performance Tuning

  • Gatling DSE Stressstars5 - Tool for stress testing DSE.
  • General

    Search / Secondary Indexes

  • Elassandra - Elassandra = Elasticsearch as a Cassandra secondary index.
  • Packages

    Open Source Applications

  • Twissandrastars790 - Twissandra is an example project, created to learn and demonstrate how to use Cassandra. Running the project will present a website that has similar functionality to Twitter.
  • Resources

    Slides

  • Hardening Cassandra for Compliance or Paranoia - Includes details on configuring SSL, setting up a certificate authority and creating certificates and trust chains for the JVM.
  • Apr 19th

    General

    Using Cassandra

  • How to install Cassandra 2 on CentOS 7 / RHEL 7 - Guide on how to install Cassandra on the popular linux distributions RedHat and CentOS.
  • General

    Cassandra Architecture

  • Common Problems with Cassandra Tombstones - Large number of tombstones causes Latency and heap pressure.
  • General

    Cassandra Performance Tuning

  • Gatling DSE Plugin for Gatling Load injectorstars8 - Plugin for the Gatling load injector. It adds CQL support in Gatling for Datastax Enterprise. It allows for benchmarking Datastax Enterprise features, including DSE Graph Fluent API.
  • General

    Integrating with Cassandra

  • Docker container for Kafka - Spark streaming - Cassandrastars92 - Dockerfile that sets up a complete streaming environment for experimenting with Kafka, Spark streaming (PySpark), and Cassandra.
  • Databases

    Monitoring / Metrics

  • cortexproject/cortexstars4.4k - Horizontally scalable, highly available, multi-tenant, long term Prometheus storage.
  • Packages

    Tools

  • CassandraCASstars2 - Compare-and-swap tool for Cassandra created by Datomic.
  • Pelotonstars568 - Unified resource scheduler created by Uber. This tool can handle many nodes and clusters through resource management and scalability.
  • Ansible-dsestars12 - Set of Ansible playbooks that will build a Datastax Enterprise cluster.
  • DBeaver - Free Universal Database Tool - Third party tool for dealing with all sorts of databases including Cassandra.
  • Web: Cassandra Calculator - Simple calculator to see how size / replication factor affect the system's consistency.
  • Netflix: Staashstars202 - Language-agnostic as well as storage-agnostic web interface for storing data into persistent storage systems, the metadata layer abstracts a lot of storage details and the pattern automation APIs take care of automating common data access patterns.
  • SSTable Toolsstars151 - Toolkit for parsing, creating and doing other fun stuff with Cassandra 3.x SSTables.
  • CQL Data Modeler - Very useful tool to test out a CQL schema and visualize what the partition would like in relationship to the columns and rows.
  • Cassandra Snapshot Backupstars6 - Quick and easy way to snapshot files in a Cassandra database and back them up using Ansible.
  • Slothsandrastars0 - Integration for Cassandra with the Slack app, which stores old messages that Slack no longer does itself.
  • sandraRESTstars22 - Cassandra manager with a web UI for RESTful APIs.
  • Cassandra Leadershipstars7 - Library to help elect leaders using cassandra. Uses paxos to build a leadership election module.
  • Terraform Cassandrastars6 - Terraform module that creates a Cassandra cluster.
  • Datadog - Third party tool that allows monitoring and metrics for Cassandra nodes and clusters.
  • Packages

    Open Source Applications

  • ChronoServerstars2 - Test server for sampling how long it takes mobile & web clients to make various types of requests to a server doing common request patterns.
  • CMBstars280 - Highly available, horizontally scalable queuing and notification service compatible with AWS SQS and SNS.
  • CassieQstars47 - Distributed queue built off of Cassandra.
  • Schedulerstars196 - Scala library for scheduling arbitrary code to run at an arbitrary time.
  • Resources

    Slides

  • GumGum: Multi-Region Cassandra in AWS - Presentation that details how Gumgum scaled out from one local Cassandra datacenter to a multi-datacenter Cassandra cluster and all the problems they encountered and choices they made while implementing it.
  • Cassandra DataTables Using Restful API - How to create a performant API using Python / Flash.
  • Securing Cassandra - Ben Bromhead CTO of Instaclustr, will explore the various ways in which you can setup and secure Cassandra appropriately for your threat environment.
  • Resources

    Videos

  • Best Practices for Running Cassandra on AWS - Joint webinar between Amazon Web Services (AWS) and Stackdriver, an AWS Technology partner, to learn best practices that apply to storing, analyzing and managing queries that equate to over 1+ billion measurements a day.
  • Apr 16th

    General

    Cassandra Architecture

  • Deletes and Tombstones - Explains how deletes create tombstones in Cassandra and what they are.
  • Databases

    Custom Time Series

  • OpenTSDB/opentsdbstars4.5k - GitHub resources for OpenTSDB. A Distributed, Scalable Monitoring System built on a Time Series Database.
  • Databases

    Graph

  • Thinkaurelius/Titanstars5.2k - Distributed Graph Database, predecessor to DSE Graph, JanusGraph, and now HugeGraph.
  • Hugegraph/Hugegraphstars1.8k - HugeGraph Database core component, including graph engine, API, and built-in backends.
  • Databases

    Miscellaneous

  • Scalar-labs/Scalardlstars43 - Tamper-evident and scalable distributed ledger platform.
  • Wikimedia/Restbasestars92 - Distributed storage with REST API & dispatcher for backend services.
  • Wikimedia/restbase-mod-table-specstars2 - Shared spec and tests for RESTBase table storage.
  • Packages

    Tools

  • Ansible-Galaxy: Cassandra - Documentation for Ansible-Galaxy: Cassandra.
  • DbSchema - Cassandra Designer - DbSchema: Cassandra Diagram Designer & GUI Admin Tool which can do Cassandra amongst other databases.
  • Cassandra-Exporterstars37 - Simple Tool to Export / Import Cassandra Tables into JSON.
  • Apr 15th

    General

    Cassandra Architecture

  • Cassandra Architecture and Operations - High level overview in one page of how Cassandra works.
  • General

    Cassandra Monitoring

  • How to Monitor Cassandra - Guide to help you monitor Cassandra performance and work metrics regardles of which monitoring tool you choose to use.
  • Cassandra metrics and their use in Grafana - Case study of using Cassandra metrics in Grafana.
  • Monitoring Cassandra with Prometheus - Quick setup guide to using Cassandra with Prometheus.
  • Cassandra Monitoring - Graphite/InfluxDB & Grafana on Docker (2/2) - Continuation of the previous entry exploring the topic of Cassandra metric reporters mentioned in Part I. The goal is to configure a reporter that sends metrics to an external time series database.
  • General

    Cassandra Performance Tuning

  • Ryan Svihla's Cassandra 2.0 checklist - Checklist for determining the efficiency of your Cassandra database.
  • Amy's Cassandra 2.1 tuning guide - Guide to tracking down performance issues in production level Cassandra clusters.
  • DSE 5.1: Tuning Java Resource - Documentation for tuning JVM.
  • Apr 14th

    Databases

    Graph

  • Introduction to TitanDB - Introductory slides about TitanDB.
  • JanusGraph/janusgraphstars4.2k - JanusGraph: an open-source, distributed graph database, successor to TitanDB.
  • Large Scale Graph Analytics with JanusGraph - Slides detailing deployment options and technical aspects of JanusGraph.
  • Architecture Overview · GitBook - Documentation for HugeGraph.
  • Databases

    Miscellaneous

  • Stargatestars527 - Stargate is an open-source data gateway that provides REST, GraphQL and schemaless JSON interfaces to Cassandra.
  • Meet Stargate, DataStax's GraphQL for databases. First stop - Cassandra - Introduction and high-level overview of Stargate.
  • Building Your Own BaaS With Apache Usergrid & Docker: Lessons Learned At Scale - Introductory presentation to Apache UserGrid.
  • General

    Cassandra Use Cases

  • Datastax Academy: What is Cassandra? - Introduction to what Cassandra is, where it came from, and some of it's benefits.
  • General

    Using Cassandra

  • Installing the Cassandra / Spark OSS Stack - Installation process and user guide for the Cassandra / Spark OSS Stack.
  • The Cassandra Query Language - Documentation for CQL.
  • Building a Performant API using Go and Cassandra - Tutorial documenting how to build a RESTful API using Go and Cassandra.
  • Introduction to Spark & Cassandra - Blog post on setting up a really simple Spark job that does a data migration for Cassandra.
  • From Cassandra to S3, with Spark - Blog post showing how to connect Spark to Cassandra, analyze event data from Cassandra, and store the results of the analysis into S3, making it available for reporting or further analysis.
  • General

    Cassandra from Relational

  • Cassandra Query Language: CQL vs SQL - Blog post documenting similarities and differences between CQL and SQL.
  • General

    Cassandra Data Modeling

  • A Deep Look at the CQL Where Clause - Blog post to describe what is supported by the CQL WHERE clause and the reasons why it differs from normal SQL.
  • Casandra Time Series Data Modeling for Massive Scale - Blog post discussing a common Cassandra data modeling technique called bucketing.
  • Scalar DBstars205 - Transaction library for Cassandra that makes non-ACID distributed databases/storages ACID-compliant.
  • General

    Cassandra Architecture

  • Curious Case of Tombstones - How someone dealt with tombstone issues and reclaimed space in their cluster.
  • General

    Cassandra Maintenance

  • Intro to CStar - Tutorial on how to use CStar.
  • General

    Cassandra Deployment

  • An Introduction to Cassandra Multi-Data Centers: Part 1 - Learn about how to plan and implement Multi-Data Centers: Part 1.
  • An Introduction to Cassandra Multi-Data Centers: Part 2 - Learn about how to plan and implement Multi-Data Centers: Part 2.
  • Databases

    Custom Time Series

  • kairosdb/kairosdbstars1.6k - Fast scalable time series database.
  • Cassandra Schema — KairosDB 1.0.1 documentation - KairosDB documentation.
  • OpenNMS/newtsstars187 - New-fangled Timeseries Data Store that powers OpenNMS.
  • Hawkular GitHub - Hawkular's GitHub resources.
  • Packages

    Tools

  • JetBrains Datagrip DB IDE - The Cross-Platform IDE for Databases & SQL by JetBrains, with support for Cassandra.
  • Cassandra SStable Toolsstars83 - Multiple different tools combined into one that helps admins get summaries, metadata, partition info, cell info.
  • Cassandra-Clientstars45 - Simple gui tool for browsing tables and data in Cassandra.
  • Zipkinstars14.8k - Distributed tracing system.
  • Instaclustr Java Driver for Kerberosstars4 - GSSAPI authentication provider for the Cassandra Java driver.
  • Instaclustr TTL Removerstars17 - Command line tool for rewriting SSTables to remove TTLs.
  • Instaclustr SSTable Generatorstars3 - CLI tool for programmatic generation of Cassandra SSTables.
  • Instaclustr Exporterstars51 - Java agent that exports Cassandra metrics to Prometheus.
  • Instaclustr Go Client for Instaclustr Icarusstars4 - Go client for Instaclustr Icarus sidecar.
  • Feb 12th

    General

    Cassandra Data Modeling

  • Cassandra Data Modeling Notes - Simple notes on how to estimate the size of your cluster.
  • Cassandra Data Modeling Best Practices Guide - Explains five Cassandra data modeling best practices.
  • General

    Cassandra Maintenance

  • Backup Strategies for Cassandra - Good comparison of different backup and restoration strategies for Cassandra.
  • Cassandra backup utilstars35 - Instaclustr's cassandra backup tool.
  • General

    Cassandra Performance Tuning

  • Analyzing Cassandra Performance with Flame Graphs - Visually examining Cassandra performance visually using Flamegraphs.
  • General

    Cassandra Deployment on Kubernetes / Kubernetized Cassandra

  • Sky UK - Cassandra Kubernetes Operatorstars23 - Kubernetes operator that manages Cassandra clusters inside Kubernetes. Well designed and organized.
  • Databases

    Monitoring / Metrics

  • filodb/FiloDBstars1.3k - Distributed Prometheus time-series database compatible with Prometheus queries.
  • cybem/cyanite-iowstars0 - Cassandra backed Carbon daemon and metric web service. IPONWEB repository, compatible with Carbon.
  • Packages

    Tools

  • cassandra-migration-tool-javastars96 - Cassandra migration tool for java is a lightweight tool used to execute schema and data migration on Cassandra database.
  • Presto - Distributed SQL Query Engine for Big Data. Presto allows querying data where it lives, including Hive, Cassandra, relational databases or even proprietary data stores.
  • Packages

    Open Source Applications

  • Cassandra-Toolsstars55 - Python Fabric scripts to help automate the launching and managing of cluster testing on AWS.
  • Packages

    Logging /Metrics

  • Cassandra CFStats to CSV Parserstars1 - Converts the output of CFStats to CSV.
  • Resources

    Books

  • Cassandra: The Definitive Guide, 2nd Edition
  • Resources

    Videos

  • Tuning the Spark Cassandra Connector - Great talk by Russell Spitzer maintainer of the Spark Cassandra connector.
  • Resources

    Slides

  • Tuning the Spark Cassandra Connector - Slides by Russell Spitzer maintainer of the Spark Cassandra connector.
  • Databases

    Graph

  • DSE Graph | Datastax - Successor to TitanDB , Commercial Tinkerpop / Gremlin compatible large scale Graph Database on DSE.
  • Jan 27th

    General

    Cassandra Deployment

  • Benchmarking Cassandra with Local Storage on Azure - Learn about comparing Cassandra on Azure VMs w/ Local vs. Remote storage.
  • General

    Spark

  • Spark + Cassandra Best Practices - Outlines general use cases and best practices of Spark & Cassandra together.
  • Jan 19th

    Packages

    Tools

  • DataStax OpsCenter - Simplified management for DataStax Enterprise and Cassandra database clusters.
  • dseansiblestars5 - DSE Installation and Upgrade Ansible Playbooks/Roles for Ubuntu Linux.
  • Packages

    Logging /Metrics

  • Cassandra Nagiosstars5 - Perl Based scripts to get metrics for monitoring using Jolokia.
  • Cassandra StatD Agentstars12 - Java Agent for Cassandra integration with StatsD.
  • Dec 9th, 2020

    Packages

    Tools

  • Instaclustr Minotaurstars4 - Command line tool for consistent rebuilding of a Cassandra cluster.
  • Aug 11th, 2020

    General

    Cassandra Deployment on Kubernetes / Kubernetized Cassandra

  • Strapdata - Elassandra Operator for Kubernetesstars9 - The Elassandra Kubernetes Operator automates the deployment and management of Elassandra clusters deployed in multiple Kubernetes clusters.
  • Apr 14th, 2020

    Feb 27th, 2020

    Packages

    Tools

  • ValuStorstars51 - ValuStor is a key-value pair database solution.
  • JanuesGraph-Utilsstars196 - Tool to Develop a graph database app.
  • Scylla-Migratorstars26 - Migrate data extract using Spark to Scylla, normally from Cassandra.
  • Cassandra CA Managerstars11 - Create and sign Java keystores.
  • Jul 29th, 2019

    General

    Cassandra Deployment on Kubernetes / Kubernetized Cassandra

  • Instaclustr - Kubernetes Operator for Cassandrastars223 - The Cassandra operator manages Cassandra clusters deployed to Kubernetes and automates tasks related to operating an Cassandra cluster.
  • Jul 19th, 2019

    General

    Cassandra

  • Apache Cassandra - Manage massive amounts of data, fast, without losing sleep.
  • General

    Cassandra Use Cases

  • Kaa application based on Raspberry Pi and DHT11 sensorstars0 - Cassandra IoT usecase with Raspberry Pi and a DHT11 Sensor.
  • Simple Node.js Express 4 Cassandra Applicationstars15 - MySubscribers is a very simple application (Start of an application) which allows you to create, read, update and delete users/subscribers. This application was only created to aid the YouTube course.
  • General

    Using Cassandra

  • Cassandra Data Copy Toolstars5 - Java tool to copy data from one cassandra table to another.
  • Import CSV files with sparkstars0 - How to import a file from S3 into cassandra using Spark.
  • Cloud DevOps with Cassandra - Using Packer, Ansible/SSH and AWS command line tools to create and DBA manage EC2 Cassandra instances in AWS.
  • General

    Cassandra from Relational

  • RDBMS to NoSQL - Your roadmap to understanding whether NoSQL is right for you.
  • General

    Cassandra Data Modeling

  • Basic Rules Of Cassandra Data Modeling - Picking the right data model is the hardest part of using Cassandra. If you have a relational background, CQL will look familiar, but the way you use it can be very different.
  • killrvideo-sample-schemastars14 - Sample Cassandra CQL Schema for a YouTube clone.
  • General

    Cassandra Security

  • Hardening Cassandra Step by Step: Part 1 - Inter-Node Encryption (And a Gentle Intro to Certificates).
  • General

    Cassandra Deployment on Docker / Containerized Cassandra

  • Docker Meet Cassandra. Cassandra Meet Docker - Article reviewing how to setup a complete Cassandra application with monitoring on Docker.
  • Cassandra & Zeppelin Notebook on Dockerstars3 - Docker-Compose script for Cassandra + Zeppelin setup.
  • General

    Spark

  • fluxcapacitor/pipelinestars4.1k - End-to-End, Real-time, Advanced Analytics Big Data Reference Pipeline using Spark, Spark SQL, Spark ML, GraphX, Spark Streaming, Kafka, NiFi, Cassandra, ElasticSearch, Redis, Tachyon, HDFS, Zeppelin, iPython/Jupyter Notebook, Tableau, Twitter Algebird.
  • General

    Search / Secondary Indexes

  • Tuning DSE Search - Tuning DSE Search – Indexing latency and query latency.
  • Cassandra Lucene Indexstars586 - Lucene based secondary indexes for Cassandra.
  • cassandra-triggerstars28 - Cassandra trigger to push realtime updates to elasticsearch.
  • Packages

    Libraries

  • express-cassandrastars180 - Cassandra ORM/ODM/OGM for Node.js with optional support for Elassandra & JanusGraph.
  • Packages

    Tools

  • cdeploystars8 - Cdeploy is a simple tool to manage your Cassandra schema migrations in the style of dbdeploy.
  • cql-vimstars37 - Cassandra CQL Syntax Highlighter for Vim.
  • Packages

    Open Source Applications

  • Cassandra Opstoolsstars53 - Generic scripts to review and monitor cassandra, from Spotify.
  • Cherami - Distributed, scalable, durable, and highly available message queue system.
  • Packages

    Logging /Metrics

  • cassandra-log4j-appenderstars19 - Cassandra appenders for Log4j.
  • Resources

    Documentation

  • DataStax Documentation - Documentation and Drivers from DataStax.
  • Resources

    Courses

  • DataStax Academy - Free online courses on Cassandra.
  • Resources

    Communities

  • Apache Software Foundation Slack - The #cassandra and #cassandra-dev channels are official slack channels migrating from IRC.
  • Stack Overflow: Cassandra
  • Stack Overflow: cql
  • Stack Overflow: spark-cassandra-connector
  • Jul 17th, 2019

    Packages

    Open Source Applications

  • Netflix-Priamstars1k - Co-Process for backup/recovery, Token Management, and Centralized Configuration management for Cassandra.
  • Oct 10th, 2018

    General

    Cassandra from Relational

  • Cassandra Schemas for Beginners (like me) - Great article for new developers to Cassandra.
  • Sep 21st, 2018

    Aug 7th, 2018

    General

    Cassandra Architecture

  • Hinted Handoff and GC Grace Demystified - Tuning the balance between GC Grace and Hinted Handoff.
  • Null bindings on prepared statements and undesired tombstone creation - Good follow up to the last article on Tombstones.
  • General

    Cassandra Maintenance

  • Running commands cluster-wide without any management tool - Some tips and tricks to do basic Cluster operations without tools like Chef, Ansible, or Salt.
  • Limiting Nodetool Parallel Threads - Little known tool to do nodetool operations with less resources.
  • Bootstrapping Cassandra Nodes - Indepth article on how to add nodes to a running Cassandra cluster.
  • Node Replacement without Bootstrapping - How to avoid the long bootstrapping process.
  • Cassandra Backup and Restore - Backup in AWS using EBS Volumes - Indepth article about Backup and recovery in AWS.
  • General

    Cassandra Performance Tuning

  • Jon Haddad: Cassandra Summit Recap - Diagnosing Problems in Production
  • Secret HotSpot option improving GC pauses on large heaps
  • Garbage Collection Tuning for Cassandra - Optimizing garbage collection for better performance.
  • TWCS part 1 - how does it work and when should you use it? - Best suited for time series data that expires, Time Window Compaction Strategy comes with some caveats.
  • Aug 3rd, 2018

    General

    Integrating with Cassandra

  • sample KafkaSparkCassandrastars23 - Introductory sample scala app using Apache Spark Streaming to accept data from Kafka and write a summary to Cassandra.
  • sample Spark Cassandra with SSLstars1 - Simple sample job illustrating the use of Spark to execute Apache Spark analytics with Cassandra with SSL connection.
  • General

    Spark

  • sample Spark Job Server Cassandrastars2 - Simple sample job illustrating the use of Spark Jobserver to execute Apache Spark analytics with Cassandra.
  • Packages

    Libraries

  • gocqlstars2.2k - Package gocql implements a fast and robust Cassandra client for the Go programming language.
  • Packages

    Tools

  • cqlmigratestars39 - Cassandra CQL migration tool. cqlmigrate is a library for performing schema migrations on a cassandra cluster.
  • Aug 2nd, 2018

    Packages

    Tools

  • CassanddraRestfulAPIstars10 - CassandraRestfulAPI project exposes the cassandra data tables with the help of Restful API.
  • Aug 1st, 2018

    General

    Cassandra Security

  • Encrypting EC2 ephemeral volumes with LUKS and AWS KMS - The example used here is Cassandra data stored on ephemeral disks.
  • General

    Cassandra Performance Tuning

  • Graphing cassandra-stress - Benchmarking schemas and configuration changes using the cassandra-stress tool, before pushing such changes out to production is one of the things every Cassandra developer should know and regularly practice.
  • Gatling DSE Stress Simulation Catalogstars4 - The goal of the repo is to provide a sample of the Gatling DSE Stress Framework's usage. Feel free to submit a pull request with example simulations.
  • General

    Cassandra Deployment on Docker / Containerized Cassandra

  • Example code from the Docker Meet Cassandra Articlestars81
  • General

    Cassandra Compliant Databases on C++

  • YugaByte Databasestars5.7k - YugaByteDB is a transactional, high-performance database for building distributed cloud services. It supports Cassandra-compatible and Redis-compatible APIs, with PostgreSQL in Beta.
  • Jul 26th, 2018

    General

    Cassandra Deployment on Docker / Containerized Cassandra

  • Packer: Cassandra Imagestars47 - Cassandra Image using Packer for Docker and EC2 AMI. Covers managing EC2 Cassandra clusters with Ansible.
  • Resources

    Communities

  • Datastax Academy Slack
  • Cassandra Slack
  • Quora: Cassandra
  • Meetups: Cassandra
  • General

    Using Cassandra

  • Spring Data Cassandra Examplesstars1 - Examples for the Spring Data Cassandra Project.
  • General

    Cassandra Data Modeling

  • Common Problems in Cassandra Data Models - Presentation and Article on wide partions, tombstones, and data skew.
  • Last Checked At: 2021-10-25T04:22:29.304Z
    Previous
    vaticle/typedb-awesome
    Next
    shime/creative-commons-media

    About

    Track your favorite github awesome repo, not just star it. trackawesomelist.com provides website, newsletter, RSS for tracking the popular awesome list by daily and weekly.
    Contact us: [email protected]
    Track Awesome List - Track your favorite Github awesome repos, not just star them | Product Hunt

    Subscribe

    Subscribe to our weekly newsletter to receive the awesome updates! We never send spam and you can unsubscribe instantly with one click. Here's past issues.

    Links

    Follow us on TwitterSubscribe us on TelegramSubmit awesome list repoNewsletterDonateSitemap