Open Data Hub Studies¶
Summary¶
The goal of Open Data Hub is to provide open source AI tools for running large and distributed AI workloads on OpenShift Container platform. While AI Library is to provide ML models as a service on OpenShift.
In general, an AI workflow includes most of the steps shown in figure below:
For data storage and availability, ODH provides Ceph, with multi protocol support including block, file and S3 object API support, both for persistent storage within the containers and as a scalable object storage data lake that AI applications can store and access data from.
Notes
[Ceph](https://docs.ceph.com/docs/master/start/intro/) delivers a self-managed, self-scaling, and self-healing storage infrastructure using storage cluster. A key Ceph architectural tenet is to have no single point of failure (SPoF) in the system.
Installation on Openshift¶
The latest version of the Open Data Hub operator project is located here: https://gitlab.com/opendatahub/opendatahub-operator
Install Ceph with the rook operator using these instructions.
The following github has the configurations: https://github.com/rook
To validate pods