Difference between revisions of "Data Deployment"

From PKC
Jump to navigation Jump to search
Line 13: Line 13:
=Visualizing Data=
=Visualizing Data=
# Tools such as Apache SuperSet<ref>Tutorial - Creating your first dashboard, https://apache-superset.readthedocs.io/en/0.35.1/tutorial.html</ref> can be very useful.
# Tools such as Apache SuperSet<ref>Tutorial - Creating your first dashboard, https://apache-superset.readthedocs.io/en/0.35.1/tutorial.html</ref> can be very useful.
=References=

Revision as of 05:28, 3 July 2021

Data Deployment is about orchestrating hardware and software resources to ensure data services would be accessible by its desired users. In the context of Data Governance, Data Deployment is one of the four modules that focuses on the operational and project management aspects of data asset management.

Industrial-Strength Data Deployment Solutions

 An important approach in Data Deployment is that many existing commercial solutions and free/open source technologies are already available, and can be leveraged to standardize the process of deploying data services. However, due to economic and politcal concerns, there are many policy and price-sensitive considerations that must be made explicit. Therefore, in the module of Data Deployment content, one should be informed of these concerns in terms of a resource lookup table, or a best-practice repository, so that known solutions can be adopted without re-inventing the wheel.

Deploying Services

To set up a fully functional data pipeline, the following pages will provide operational guidance:

  1. Testing on Vagrant
  2. Installing Jenkins
  3. Installing Dockers and Kubernetes
  4. Launch PKC Data Services

Visualizing Data

  1. Tools such as Apache SuperSet[1] can be very useful.

References

  1. Tutorial - Creating your first dashboard, https://apache-superset.readthedocs.io/en/0.35.1/tutorial.html