作者:Vanessa Sochat
### 作者邮箱:vsochat@stanford.edu
### 首页:https://github.com/vsoch/cdb
### 文档:None
### 下载链接
# Container Database (cdb)
This is the Python support tool for containerdb to support generation of data containers. Python is more friendly to generating arbitrary data structures, and is popular among the data science community, so I chose it for metadata generation instead of using GoLang.
Have your data and use it too!
For documentation and full examples see vsoch.github.io/cdb. These examples are also available in the examples folder.
Getting Started
What is a Data Container?
A data container is generally an operating-system-less container that is optimized to provide data, either for query/search, or binding for analysis. The qualities of the data container should be:
It can be mounted to containers with operating systems to run analysis
It can be interacted with on it's own to search metadata about the data
It should not have an operating system.
How do we generate one?
The generation is fairly simple! It comes down to a three step multistage build:
Step 1 We install cdb to generate a GoLang template for an in-memory database for our data)
Step 2 We compile the binary into an entrypoint
Step 3 We add the data and the binary entrypoint to a scratch container (no operating system).
And then we interact with it! This tutorial will show you the basic steps to perform the multistage-build using a simple Dockerfile along with the data fol