Data Science


A data visualization tool used to create interactive visual analytics in the form of dashboards and generate compelling business insights. These dashboards make it easier for non-technical analysts and end users to convert data into understandable, interactive graphics.

Time Series Analysis

A statistical technique that deals with time series data, or trend analysis. There are two main goals of time series analysis: (a) identifying the nature of the phenomenon represented by the sequence of observations, and (b) forecasting (predicting future values of the time series variable). 

Torch AI

A deep learning library of algorithms based on the LuaJIT scripting language. Torth has a neural network library and comes with machine learning, computer vision, audio, image and video capabilities. It can be viewed as a scientific computer framework.


A big data analytics platform with a distributed architecture and columnar compression for reliability and speed. Vertica is designed for use in big data workloads where speed, simplicity and scalability are essential. It offers SQL and geospatial analysis functions, machine learning models and Hadoop integration.


Stands for Waikato Environment for Knowledge Analysis. WEKA is machine learning software which provides algorithms and tools for data analysis, visualization, predictive modeling and a user-friendly interface. It also allows the implementation of pre-processing, clustering, regression, association and classification rules.


The architectural center of Hadoop that allows multiple data processing engines to handle data stored in a single platform. 

Machine Learning

A field of study which explore that capability of a computer to learn, akin to a human brain. It also studies the construction of algorithms that can learn from and make predictions on data. Basically, it’s intention is to teach a computer how to learn at its own volition, and not because it has been programmed to.


A machine learning-based toolkit for the processing of natural language text. The toolkit provides support for the most common NLP tasks, such as language detection, parsing, tokenization, sentence segmentation, and more.


Stands for Statistical Analysis System. A data analysis tool for data management, data mining, statistical analysis, data warehousing and more. It also provides features for writing reports, developing business models and applications.

Data Science

An interdisciplinary study of information sources, what the information represents and ways of turning it into a valuable resource when creating business and IT strategies. It uses methods and techniques of statistics and data analysis to analyze and understand a phenomenon.

Deep Learning

A branch of machine learning based on a specific set of algorithms. These algorithms are called artificial neural networks and were designed to mimic a human brain’s structure and function. The algorithms can learn different levels of representation (abstraction) through classification and pattern analysis, among other methods.


Stands for Natural Language Processing. A field of Computer Science and Artificial Intelligence which studies ways for computers to analyze, understand human language and derive meaning from it, especially the programming aspect of it. The goal of NLP is to facilitate computer-human interactions.


Stands for Recurrent Neural Networks. It is a class of Artificial Neural Networks commonly used with sequential data. RNNs allow retaining information from a previous input in each neuron of the network, which is possible because they have loops due to which information can be passed along between neurons. RNNs can be used in conjunction with NLP methods and in machine translation.

Development by

Sign up for updates
straight to your inbox