Database

Resources

Data Lake vs Data Warehouse

Data Lake

  • is a system or repository of data stored in its natural/raw format, usually object blobs or files

  • is a single store of data including raw copies of source system data, sensor data, social data etc

  • A data lake can include structured data from relational databases (rows and columns), semi-structured data (CSV, logs, XML, JSON), unstructured data (emails, documents, PDFs) and binary data (images, audio, video)

  • A data lake can be established "on premises" (within an organisation's data centres) or "in the cloud" (using cloud services from vendors such as Amazon, Microsoft, or Google)

Data Warehouse

Blobs

  • BLOB stands for Binary Large OBject. A blob is a data type that can store binary data

  • This is different than most other data types used in databases, such as integers, floating point numbers, characters, and strings, which store letters and numbers

  • BLOB is a large complex collection of binary data which is stored in Database

  • Basically BLOB is used to store media files like images, video and audio files

  • Due to its ability to store multimedia files it takes a huge disk space

  • Also length of BLOB may go upto 2, 147, 483, 647 characters

  • BLOB provides fast multimedia transfer

ETL vs ELT Approach:

ETL Approach

ELT Approach

  • Extract, load, transform (ELT) is an alternative to (ETL) used with data lake implementations

  • the data is not transformed on entry to the data lake, but stored in its original raw format, this enables faster loading times

  • ELT requires sufficient processing power within the data processing engine to carry out the transformation on demand, to return the results in a timely manner

  • Since the data is not processed on entry to the data lake, the query and schema do not need to be defined a priori

Data Pipeline

Lambda Architecture

Big Data

circle-check

Different Processing Types

Stream Processing

Batch Processing

References for Further Reading

In case unable to open the link, open in incognito window

Last updated