magda

This is a guide to deploying a highly-available version of magda for production use, similar to what’s used on search.data.gov.au. Be aware this is heavily coupled to our providers and costs roughly $1000/month. If you want something lighter, look at just using the defaults in https://github.com/magda-io/magda-config and modifying them as you go.

What you’ll be using if you run Magda like this:

Google Kubernetes Engine
Google Cloud SQL
AWS Route53
Mailgun

Create a new Google Kubernetes Engine cluster - for high availability you’ll want to spread this over at least 3 nodes and at least two zones. How many nodes you’ll need depends on how many resources you want to set in the magda config - if you use the development settings you only need 2 CPUs, if you use the search.data.gov.au settings you’ll need 10-12. You can always use autoscaling to get you the right number of nodes, but be aware that it tends to love scaling up but hates scaling down.
Set up a Google Cloud SQL database: https://cloud.google.com/sql/docs/mysql/connect-kubernetes-engine. Make sure you set that up for HA too.
Install cert-manager and a cert issuer: https://github.com/magda-io/magda/tree/master/deploy/helm/magda-cert-issuer
Go to the Google Cloud Platform console, and in VPC Network / External IP addresses, reserve yourself an IP address
In AWS Route53, point your desired domain at that address
Fork and follow the instructions at https://github.com/magda-io/magda-config, but first copy paste the values from https://github.com/magda-io/magda-config-data.gov.au. Make sure to change:

global.externalUrl: What’s the external url that you’ll be deploying the website on?
gateway.auth.x: Put the ids of your google/facebook apps for OAuth if you have them. You’ll also need to create an oauth-secrets secret containing a facebook-client-secret and google-client-secret.
connectors.config: Put the external connectors you want to use here
includeInitialJobs: Set this to true if you want the system to be immediately populated with external data
ingress.ipName: Should be the name of the ip you reserved in the Google Cloud console
ingress (others): Make sure you’ve put your hostname and any other domains you want to get a let’s encrypt cert for here.
correspondence-api.defaultRecipient: This should be a general email
gateway.ckanRedirectionDomain: If you’re replacing a CKAN instance set the domain here, otherwise leave it out.
gateway.csp: Put your CSP overrides in here and take out SDGA’s.
cloud-sql-proxy.instanceConnectionName: Needs to be the name of your Google Cloud SQL instance

Install MAGDA:

helm install --name magda deploy/helm/magda -f <path to your config file>

At this point everything should start up. It’ll probably take a while to get an actual working system with data (in particular loading regions for spatial search takes ages). Raise an issue if it doesn’t work!