Data Source: Difference between revisions

From GM-RKB
Jump to navigation Jump to search
m (Text replacement - "<U>Example(s)</U>" to "Example(s)")
m (Text replacement - "<B>Examples:</B>" to "<B>Example(s):</B>")
 
(24 intermediate revisions by 4 users not shown)
Line 1: Line 1:
A [[Data Source]] is a source of a [[Data Set]].
A [[Data Source]] is a [[system component]] that provides [[data set]]s (to support [[data-driven process]]es and [[data-driven system]]s).
* <B>AKA</B>: [[Data Feed]].
* <B>Context:</B>
* <B><U>Context</U>:</B>
** It can generate [[Data Content]] through [[data generation mechanism]]s.
** It can range from being a [[Batch Data Source]] to being an [[Real-Time Data Source]] (such as a [[Real-Time Data Source]]).
** It can provide [[Data Access]] via [[access interface]]s.
* <B>Example(s):</B>  
** It can maintain [[Data Format]] through [[format specification]]s.
** the [[New York Stock Exchange Data Feed]], is a [[Data Stream]].
** It can ensure [[Data Quality]] via [[quality control]]s.
** the [[UCI ML Repository]].
** It can support [[Data Availability]] through [[delivery mechanism]]s.
** a [[Product Data Feed]].
** It can often include [[Data Schema]] with [[structural element]]s.
* <B><U>See</U>:</B> [[Data Collection Task]], [[Real-Time Computing]], [[Web Feed]], [[RSS Feed]], [[Product Feed]], [[E-Commerce]], [[Internet Marketing]].
** It can often enable [[Data Discovery]] through [[metadata service]]s.
** It can often support [[Data Integration]] via [[integration protocol]]s.
** It can often maintain [[Data Freshness]] through [[update mechanism]]s.
** ...
** It can range from being a [[Batch Data Source]] to being a [[Real-Time Data Source]], depending on its [[delivery mode]].
** It can range from being a [[Static Data Source]] to being a [[Dynamic Data Source]], depending on its [[update frequency]].
** It can range from being a [[Simple Data Source]] to being a [[Complex Data Source]], depending on its [[data structure]].
** It can range from being a [[Raw Data Source]] to being a [[Processed Data Source]], depending on its [[processing level]].
** It can range from being a [[Single Data Source]] to being a [[Federated Data Source]], depending on its [[distribution model]].
** It can range from being a [[Structured Data Source]] to being an [[Unstructured Data Source]], depending on its [[data format]].
** It can range from being a [[Point Data Source]] to being a [[Data Flow Source]], depending on its [[temporal continuity]].
** ...
** It can integrate with [[Data Consumer]] for [[data consumption]].
** It can connect to [[Data Pipeline]] for [[data flow]].
** It can support [[Data Repository]] for [[data storage]].
** ...
* <B>Example(s):</B>
** [[Enterprise Data Source]]s, such as:
*** [[Market Data System]]s, such as:
**** [[New York Stock Exchange Data Feed]] for [[financial trading]].
**** [[Bloomberg Data Feed]] for [[market analysis]].
*** [[Business Intelligence Source]]s, such as:
**** [[Sales Analytics Feed]] for [[performance tracking]].
**** [[Customer Data Platform]] for [[behavior analysis]].
** [[Research Data Source]]s, such as:
*** [[Academic Repository]]s, such as:
**** [[UCI ML Repository]] for [[machine learning]].
**** [[Real-World Dataset]] for [[empirical research]].
*** [[Clinical Source]]s, such as:
**** [[Clinical Trial Data]] for [[medical research]].
**** [[Healthcare Dataset]] for [[patient analysis]].
** [[Technology Data Source]]s, such as:
*** [[Stream System]]s, such as:
**** [[Real-Time Data Stream]] for [[live monitoring]].
**** [[Event Data Feed]] for [[system tracking]].
*** [[Sensor Network]]s, such as:
**** [[IoT Sensor Feed]] for [[device monitoring]].
**** [[Environmental Sensor]] for [[condition tracking]].
** [[Web Data Source]]s, such as:
*** [[Content Feed]]s, such as:
**** [[Web Feed]] for [[content syndication]].
**** [[RSS Feed]] for [[information distribution]].
*** [[E-Commerce Feed]]s, such as:
**** [[Product Data Feed]] for [[e-commerce]].
**** [[Product Offer Data Feed]] for [[retail distribution]].
** [[AI Training Source]]s, such as:
*** [[Model Training Data]]s, such as:
**** [[LLM Training Dataset]] for [[model training]].
**** [[Multimodal Dataset]] for [[AI development]].
*** [[Specialized Corpus]]es, such as:
**** [[DEFT Corpus]] for [[text analysis]].
**** [[Wikipedia Data Feed]] for [[information extraction]].
** [[Integration Source]]s, such as:
*** [[Enterprise Integration]]s, such as:
**** [[Integration Server System]] for [[data aggregation]].
**** [[AWS Data Pipeline]] for [[data orchestration]].
*** [[Data Lake Source]]s, such as:
**** [[Data Lake Feed]] for [[unified storage]].
**** [[Data Warehouse Stream]] for [[analytical processing]].
** [[Government Data Source]]s, such as:
*** [[Public Dataset]]s, such as:
**** [[Census Data Feed]] for [[demographic analysis]].
**** [[Open Government Data]] for [[public research]].
*** [[Regulatory Feed]]s, such as:
**** [[Compliance Data Stream]] for [[regulatory reporting]].
**** [[Policy Update Feed]] for [[governance tracking]].
** [[Social Data Source]]s, such as:
*** [[Social Media Feed]]s, such as:
**** [[Social Network Stream]] for [[engagement analysis]].
**** [[User Generated Content Feed]] for [[trend analysis]].
*** [[Community Data]]s, such as:
**** [[Forum Data Stream]] for [[discussion analysis]].
**** [[Review Platform Feed]] for [[sentiment tracking]].
** [[Weather Data Source]]s, such as:
*** [[Meteorological Feed]]s, such as:
**** [[Weather Station Data]] for [[condition monitoring]].
**** [[Satellite Data Feed]] for [[atmospheric analysis]].
*** [[Climate Data]]s, such as:
**** [[Historical Climate Feed]] for [[trend analysis]].
**** [[Forecast Data Stream]] for [[prediction modeling]].
** ...
* <B>Counter-Example(s):</B>
** [[Computation Resource]], which processes rather than provides [[data]].
** [[Data Sink]], which consumes rather than provides [[data]].
** [[Data Cache]], which temporarily stores rather than originates [[data]].
** [[Data Proxy]], which forwards rather than generates [[data]].
* <B>See:</B> [[Data Provider]], [[Data Origin]], [[Data System]], [[Data Collection Task]], [[Real-Time Computing]], [[E-Commerce]], [[Internet Marketing]].
 
----
----
----
----
==References==
 
== References ==
 
=== 2024 ===
* (Wikipedia, 2024) ⇒ https://en.wikipedia.org/wiki/Data_feed Retrieved:2024-12-23.
** '''Data feed''' is a mechanism for users to receive updated data from data sources. It is commonly used by [[real-time computing|real-time]] applications in [[point-to-point (telecommunications)|point-to-point]] settings as well as on the [[World Wide Web]]. The latter is also called [[web feed]]. News feed is a popular form of web feed. [[RSS feed]] makes dissemination of [[blog]]s easy. [[Product feed]]s play increasingly important role in [[e-commerce]] and [[internet marketing]], as well as [[news distribution]], [[financial markets]], and [[cybersecurity]].  Data feeds usually require structured data that include different labelled fields, such as "title" or "product".


=== 2014 ===
=== 2014 ===
* (Wikipedia, 2014) &rArr; http://en.wikipedia.org/wiki/data_feed Retrieved:2014-2-10.
* (Wikipedia, 2014) http://en.wikipedia.org/wiki/data_feed Retrieved:2014-2-10.
** '''Data feed''' is a mechanism for users to receive updated data from data sources. It is commonly used by [[real-time computing|real-time]] applications in [[point-to-point (telecommunications)|point-to-point]] settings as well as on the [[World Wide Web]]. The latter is also called [[web feed]]. News feed is a popular form of web feed. [[RSS feed]] makes dissemination of [[blog]]s easy. [[Product feed]]s play increasingly important role in [[e-commerce]] and [[internet marketing]]. Data feeds usually requires structured data. At the present time unstructured data, e.g. HTML pages, dominate the Web. As a result, data feeds have the potential to make a bigger impact on the Web in the future.
** '''Data feed</B> is a mechanism for users to receive updated data from data sources. It is commonly used by [[real-time computing|real-time]] applications in [[point-to-point (telecommunications)|point-to-point]] settings as well as on the [[World Wide Web]]. The latter is also called [[web feed]]. News feed is a popular form of web feed. [[RSS feed]] makes dissemination of [[blog]]s easy. [[Product feed]]s play increasingly important role in [[e-commerce]] and [[internet marketing]]. Data feeds usually requires structured data. At the present time unstructured data, e.g. HTML pages, dominate the Web. As a result, data feeds have the potential to make a bigger impact on the Web in the future.


----
----
__NOTOC__
[[Category:Concept]]
[[Category:Concept]]
__NOTOC__
[[Category:Data Systems]]
[[Category:Quality Silver]]

Latest revision as of 23:05, 28 December 2024

A Data Source is a system component that provides data sets (to support data-driven processes and data-driven systems).



References

2024

2014