Big dataLarge volumes of complex and rapidly accumulating data sets
Definitions:
An amount of data that is beyond the ability of typical software tools to capture, manage, and process efficiently.
Cluster computingThe use of multiple computers to accomplish a task or operation.
Definitions:
A form of computing in which a group of computers are linked together so that they can perform tasks more efficiently.
Resilient Distributed Dataset (RDD)A data structure in Apache Spark.
Definitions:
An immutable distributed collection of objects that can be processed in parallel.
DAGAcronym for Directed Acyclic Graph.
Definitions:
A concept in mathematics and computer science that refers to a system of items in which each item has a direction and cannot be returned to from its previous item.
ImmutabilityThe state of being unchangeable.
Definitions:
A property of a set or a data type whose elements or data, once created, cannot be changed.