🚀 Join our exclusive newsletter for Recruiters and Sourcers

Launching on Oct 20th, 2023. Become tech-savvy – sign up today!

Databases

Spark SQL

A Spark module for structured data processing. It provides a programming abstraction called DataFrames and can also act as a distributed SQL query engine. It enables unmodified Hadoop Hive queries to run up to 100x faster on existing deployments and data.

JSONB

Datatype which converts text-format JSON object into a binary format. It removes unnecessary white spaces and duplicate keys and sorts the keys. Because of these pre-processing, JSONB consumes more spaces and processing power than JSON.

ClickHouse

A fast open-source column-oriented database management system that allows generating analytical data reports in real-time using SQL queries.

ScyllaDB

A drop-in Apache Cassandra alternative big data database that powers applications with ultra-low latency and extremely high throughput.

Key-Value Database

A type of nonrelational database that uses a simple key-value method to store data. A key-value database stores data as a collection of key-value pairs in which a key serves as a unique identifier.

PipelineDB

PipelineDB is an open-source relational database that runs SQL queries continuously on streams, incrementally storing results in tables.

RDMS

Stans for the relational database management system. RDBMS is a collection of programs and capabilities that enable IT teams and others to create, update, administer and otherwise interact with a relational database.

Database administration

Database administration is the function of managing and maintaining database management systems software.

Apache Druid

Druid is a column-oriented, open-source, distributed data store written in Java. Druid is designed to quickly ingest massive quantities of event data, and provide low-latency queries on top of the data.

Aerospike

Aerospike is a flash-optimized in-memory open source NoSQL database.

AllegroGraph

AllegroGraph is a database and application framework for building Semantic Web applications. It can store data and meta-data as triples; query these triples through various query APIs like SPARQL and Prolog; and apply RDFS++ reasoning with its built-in reasoner.

CrateDB

CrateDB is a distributed SQL database management system that integrates a fully searchable document-oriented data store.

DBMS

The database management system is the software that interacts with end users, applications, and the database itself to capture and analyze the data.

Azure Cosmos DB

Azure Cosmos DB is a fully managed database service with turnkey global distribution and transparent multi-master replication. 

ArangoDB

An open source NoSQL database with a flexible data model for documents and graphs. It is designed as a “general purpose database,” offering all the features typically needed for modern web apps.

AWR

Automatic Workload Repository is a built-in repository in every Oracle Database. The Oracle DB makes a snapshot of all of its vital statistics and workload information and stores them in the AWR. 

Berkeley DB

A built-in database system for key/value data. It is easy to use, supports the ability to simultaneously access multiple users, implements transaction support at the industrial level, and restores databases after system and disk failures.

Big Table

A proprietary high-performance database built on the basis of Google File System (GFS), Chubby Lock Service and some other Google products. Currently not distributed or used outside Google.

Cassandra

A distributed database management system related to the class of NoSQL-systems and designed to create highly scalable and reliable storages of huge data sets providing high availability with no single point of failure.

CockroachDB

A distributed SQL database for cloud services. CockroachDB is built on consistent key-values, horizontal scalability, and to survive datacenter, disk or machine failures with minimal disruptions due to replication and automated repair features. It is more suitable in the cases of multi-datacenter and -region deployments, cloud migrations, cloud-native infrastructure, and replicated or distributed OLTP (online transaction processing), rather than for heavy analytics.

CouchBase

A distributed NoSQL engagement database that supports multiple models. CouchBase offers flexibility, high performance and on-demand scalability for both mobile and cloud-based web applications. It also saves on operations, software and hardware costs.

CouchDB

An open-source, NoSQL document-oriented database that is optimized for interactive applications, it provides a fast key-value store and JSON document, that focuses on the ease of use. 

CRUD

Create, Read, Update, Delete. The acronym CRUD refers to all of the major functions that are implemented in relational database applications. CRUD is also sometimes used to describe user interface conventions that facilitate viewing, searching, and changing information; often using computer-based forms and reports.

CSV

A simple file format used to store tabular data. CSV is supported by a huge number of tools, from spreadsheets like Excel, OpenOffice and Google Docs to complex databases.

Data Architecture

The data architecture defines the data along with the schemas, integration, transformations, storage, and workflow required to enable the analytical requirements of the information architecture.

Development by Synergize.digital

Sign up for updates
straight to your inbox