parent | title | has_children | nav_exclude | nav_order |
---|---|---|---|---|
Getting Started |
Running the Samples |
false |
false |
1 |
This guide is targeted towards developers, contributors, or anyone who wants to test out the various modules which available. We will start with the Data Lake Sample on AWS, and from there then you can deploy any additional examples using the same procedure.
- Start by installing the required DataOps tools.
- Go to https://docs.dataops.tk/setup.
- Follow the provided installation steps, including at minimum: Git, Terraform, Docker, Visual Studio Code
- Clone the
dataops-infra
repo.
- Create a new file in the
.secrets
folder calledaws-credentials
and enter your AWS credentials.- For reference, refer to the sample file:
.secrets/aws-credentials.template
.
- For reference, refer to the sample file:
- In left-hand navigation pane in VS Code, expand the
samples
folder select thedata-lake-on-aws
sample. - Right-click the desired sample folder (
data-lake-on-aws
in this example) and select "Open in Integrated Terminal". - Run
terraform init
to download needed providers and then runterraform apply
to deploy the infrastructure.
- Once again, in the left-hand navigation pane in VS Code, expand the
samples
folder select thedata-lake-on-aws
sample. - Right-click the desired sample folder (
data-lake-on-aws
in this example) and select "Open in Integrated Terminal". - Run
terraform destroy
to tear down the environment.- When prompted, type 'y' to confirm.
Once you have successfully deployed the data lake sample, you are ready to deploy more advanced infrastructures. This section will walk you through deploying the Kitchen Sink Sample on AWS.
Before you go further:
Some of these components require python, and now's a good time to pause and double check that python and it's installer "pip" are both working as expected. In any command line try running
pip3 --version
. If that doesn't work, you can trypip --version
. If that doesn't work, please revisit the datapops quickstart and follow the instructions to re-install python (after first uninstalling any versions you already have installed). Similarly, you can double check that terraform is installed by runningterraform --version
on any command line prompt.And one more thing.... remember that you are deploying real infrastructure into the cloud, an those resources cost real money. Be sure to run
terraform destroy
to cleanup your AWS environment after running each deployment. Alternatively, you can also use an environment like LinuxAcademy Playground which automically cleans up your resources after a specified time limit.
Once you are ready to go:
- In left-hand navigation pane in VS Code, expand the
samples
folder select thekitcken-sink-on-aws
sample. - Right-click the
kitcken-sink-on-aws
sample folder and select "Open in Integrated Terminal". - Run
terraform init
to download needed providers and then runterraform apply
to deploy the infrastructure.- If needed, perform any necessary debugging until you succeed in running
terraform apply
without errors. At any time, especially if you get stuck on a particular module, you may still be able to runterraform output
, which will give you an overview of the components which have deployed successfully.
- If needed, perform any necessary debugging until you succeed in running
- Your deployed infrastructure should look similar to the below:
- Important: Once you are done, run
terraform destroy
to destroy the infrastructure which you have deployed.
Navigate to the samples directory and select from one of the samples. Each sample folder has a README file which explains the functionality and links to readme for each of the related catalog components.