Data Ingestion Task
Jump to navigation
Jump to search
A Data Ingestion Task is data processing task that gets data from its source to its intended destination for a data-driven system.
- Context:
- It can produce a Data Ingestion Pipeline System.
- It can range from being a Batch Data Ingestion Task to being a Streamed Data Ingestion Task.
- …
- See: ETL Task, Data Ingestion Pipeline.
References
2020
- https://www.stitchdata.com/resources/data-ingestion/
- QUOTE: ... Data ingestion is the transportation of data from assorted sources to a storage medium where it can be accessed, used, and analyzed by an organization. The destination is typically a data warehouse, data mart, database, or a document store. Sources may be almost anything — including SaaS data, in-house apps, databases, spreadsheets, or even information scraped from the internet. …
2017
- (Meehan et al., 2017) ⇒ John Meehan, Cansu Aslantas, Stan Zdonik, Nesime Tatbul, and Jiang Du. (2017). “Data Ingestion for the Connected World.” In: CIDR.
- QUOTE: Data ingestion is the process of getting data from its source to its home system as efficiently and correctly as possible. This has always been an important problem and has been targeted by many previous research initiatives, such as data integration, deduplication, integrity constraint maintenance, and bulk data loading. Data ingestion is frequently discussed under the name of Extract, Transform, and Load (ETL). …