Difference between revisions of "Data-centric knowledge"

From PKC
Jump to navigation Jump to search
Line 4: Line 4:
Knowledge is considered to be a derived property from data asset collected under the context of [[MU]] data operations. Every piece of knowledge needs to go through the following stages to be given a representable handle for ongoing integration of knowledge content:
Knowledge is considered to be a derived property from data asset collected under the context of [[MU]] data operations. Every piece of knowledge needs to go through the following stages to be given a representable handle for ongoing integration of knowledge content:
# Grounding Raw Data: This data set is collected from widely deployed user terminals or certified data sensors that should always be annotated with timestamps and spatial tags that explicitly specify who, when and where the data are being collected. These raw data content, especially the timestamps and location/account that provided the data will be used as a reference to determine the authenticity of data.
# Grounding Raw Data: This data set is collected from widely deployed user terminals or certified data sensors that should always be annotated with timestamps and spatial tags that explicitly specify who, when and where the data are being collected. These raw data content, especially the timestamps and location/account that provided the data will be used as a reference to determine the authenticity of data.
# Inferred information: The ordering and semantic implication of the data content would further define the information content supported by [[Grounding Raw Data]]. This computational procedure is defined by a set of computational inference tools. For example, using [[Bayesian Belief Network]], or [[Trained Neural Network]], they will infer the probability distribution of certain events, therefore present information content beyond raw data. However, the authenticity of these decision algorithms and their training data sets must be verified as a part of the version control system of [[MU]] certified [[PKC]].
# Inferred information: The ordering and prioritization of information content is filtered by previously mentioned raw data. This information filtering procedure is conducted by a set of computational inference tools, whose source code are version-controlled based on [[MU]]-compliant rules. Computational procedures specified using Neural network, Bayesian Belief Networks, System Dynamic models, and other data-intensive inference mechanisms will have the input data content be part of the version-controlled data asset.
# Action of Acknowledgement: Is a set of causal relations that are written into actionable or executable programs/contracts. An action of acknowledgement can be automatically triggered by verified raw data and programmatically computed information content, including semi-automatically acknowledged by human-in-the-loop authorization of action. The event of acknowledgement can be represented as a piece of authenticated data that possess pragmatic value, such as a token of appreciation, honor badges, or cash payment.
# Action of Acknowledgement: Is a set of causal relations that are written into actionable or executable programs/contracts. An action of acknowledgement can be automatically triggered by verified raw data and programmatically computed information content, including semi-automatically acknowledged by human-in-the-loop authorization of action. The event of acknowledgement can be represented as a piece of authenticated data that possess pragmatic value, such as a token of appreciation, honor badges, or cash payment.



Revision as of 09:32, 18 February 2022

Data-centric knowledge is a formalized mapping of concepts to data points. Its universal applicability is based on the representability assumption of Kan Extension. Kan extension states that all concepts and idealized knowledge are representable through functors from a domain of complex data types to uniquely identifiable data entries in set-theoretic format. This means that knowledge of any kinds can all be stored or represented using concrete data points stored in databases.

Data-Centric Knowledge under the context of MU

Knowledge is considered to be a derived property from data asset collected under the context of MU data operations. Every piece of knowledge needs to go through the following stages to be given a representable handle for ongoing integration of knowledge content:

  1. Grounding Raw Data: This data set is collected from widely deployed user terminals or certified data sensors that should always be annotated with timestamps and spatial tags that explicitly specify who, when and where the data are being collected. These raw data content, especially the timestamps and location/account that provided the data will be used as a reference to determine the authenticity of data.
  2. Inferred information: The ordering and prioritization of information content is filtered by previously mentioned raw data. This information filtering procedure is conducted by a set of computational inference tools, whose source code are version-controlled based on MU-compliant rules. Computational procedures specified using Neural network, Bayesian Belief Networks, System Dynamic models, and other data-intensive inference mechanisms will have the input data content be part of the version-controlled data asset.
  3. Action of Acknowledgement: Is a set of causal relations that are written into actionable or executable programs/contracts. An action of acknowledgement can be automatically triggered by verified raw data and programmatically computed information content, including semi-automatically acknowledged by human-in-the-loop authorization of action. The event of acknowledgement can be represented as a piece of authenticated data that possess pragmatic value, such as a token of appreciation, honor badges, or cash payment.

Related Pages