Building your first Action for Google Home (in 30 minutes)

Jonathan Eisenzopf
Google Cloud - Community
9 min readApr 5, 2017

Developing Actions on Google Series Overview

This is the first in a series of articles that I’ll be writing where we will learn how to develop and deploy Actions on Google. Actions power the Google Home and Google Assistant and allow users to interact with your application via voice or text. The goal is to make this process fun and easy for any developer level. Actions can currently be used on Google Home devices or in the Google Assistant app on Google Pixel phones. This tutorial will focus on the voice interface.

We will be developing a simple random room generator called Three Doors. If you want to interact with it now and you have your Google Home nearby, simply say, “Hey Google I want to talk to Crazy House”.

In this first article, the goal is to get you up and running in as little time as possible so that you can run your Action in test mode. In future articles, we will dive deeper into the code, learn how to test and deploy your action, and use other tools like API.ai. If you have any problems or suggestions for future articles, reach out to me @eisenzopf on twitter. For now, time is ticking so let’s get started.

Note: We will be using the gaction command line interface (CLI) throughout this article, because it’s faster and scriptable. Every action in this article using the CLI has a corresponding equivalent from the Google Console.

Pre-requisites

First, I’m assuming that you have some basic technical skills and can work around Linux or OS X.

To make this work, all you’ll need is:

  • A Google Home device
  • Linux/Unix/OS X with Bash, curl, Node.js, git, and npm (bundled with Node.js)
  • Python 2.7 or Later
  • Java 1.7 or later
  • a Google Gmail account (recommended but not required)

If you don’t have a Google Home device and/or Node.js, see the Reference section below. If you are running Windows 10, see the reference section below for installing Bash and curl on your system. Note for Windows 10 users: remove the sudo anytime you see it in this tutorial as it is not needed.

Note: Once you finish this article, your first stop should be the Design Checklist by my good friend Nandini Stocker, one of the best voice user interface (VUI) designers in the business. You must follow the design guidelines in order for your Google Action to be approved. We will talk more about deploying your application in a future article.

How Google Actions work

Without getting too detailed, the way a Google Home device works is that it captures your voice intent on the Google Action Platform and translates this to text.

  1. The Google Action Platform attempts to match the Intent to an Action in the directory and then to an Action endpoint. As a developer, when you’re ready to deploy an action, there is a Google registration process where you register your action intent name, like Three Doors. The Google Action Platform then know what Action to send the request to. Once registered, the Google Action Platform makes requests to your server side application via an HTTP POST request and sends the user’s input via a JSON document.
  2. Your application decodes the user’s inputs in a Node.js or other server side application, and sends back a JSON response, which will play content back to the user or ask for input. In the case above, we’re prompting the user to pick a door.
  3. The users says three which is sent to Google, translated to text, and sent on to the server side application, which takes the input and responds with another JSON document, which could be the next selection or end the dialog.

Create a directory for your code

It doesn’t really matter where you put it and I know it’s obvious, but start by creating a new directory for our code:

mkdir three-doors
cd three-doors

Clone three-doors from GitHub

Now you can clone the code for this tutorial from GitHub:

git clone https://github.com/eisenzopf/google-action-three-doors.git
cd google-action-three-doors

The key source code files are:

action.json.bak, index.js, package.json, and the files in the json/ directory

Install Node.js packages

Next run:

npm install

This should install all of the Node.js package dependencies that you need for the project.

Setting up your Google Action Skill

Now that we have an application environment setup, we can setup our environment for creating a Google skill.

Google cloud setup

If you already have a Google cloud account with billing setup, you can skip this step. Otherwise, you will need to visit https://cloud.google.com/ and create a new account. You can set up a trial account for free and at the time of this writing, Google is offering a $300 credit that can be used over 12 months. Simple.

Once you go through the setup, the system will drop you into the Google Cloud Console.

Install the gcloud CLI and create a new Google Action project

Now that you have a Google Cloud account, we are going to install the Google Cloud CLI called gcloud on our local machine and use it to create a new Google Action project.

1. To install the gcloud CLI, we must first install the Google Cloud SDK. There are several different installation options, which are all documented at https://cloud.google.com/sdk/downloads. The easiest way is to run the following commands:

curl https://sdk.cloud.google.com | bash
exec -l $SHELL
gcloud init

You will be asked a number of questions that are straightforward and once done, the gcloud will be ready to go.

2. Now we will create a new Google Action project using the gcloud CLI from a terminal window from within your project directory. Run:

gcloud alpha projects create three-doors-xxxx

Note: Replace xxxx with four random numbers of your choosing.

This will create a new project within your Google Cloud account. This is an alpha command and may change. You can alternatively create a new project in the Google Cloud Console GUI.

3. To confirm the project was created, run:

gcloud projects list

You should see your newly created project on the list in addition to a default project that was created when you ran cloud init command.

4. Change current project to the created project by running:

gcloud config set project three-doors-xxxx

Note: Replace xxxx with the four numbers you chose above.

5. Set billing for the project. Make sure that a billing account is associated with the project by visiting

https://console.developers.google.com/project/three-doors-xxxx/settings where xxxx is the 4 numbers you chose when creating the project.

6. Next, we need to add the Google Action API to the project. This can be done from the Google Cloud Platform Console at https://console.cloud.google.com.

  • From the console, at the very top left of the page to the right of the Google Cloud Platform logo, you will see two drop-down menus. Click on the second one that may say My First Project
  • Select the option labeled View more projects.
  • Select the project you just created from the gcloud CLI (three-doors-xxxx) and click on the OPEN button.
  • In the middle of the page click on the link labeled Go to APIs overview.
  • This page shows you which Google APIs have been enabled, but we need to add one so click the +ENABLE API link near the top middle of the page.
  • In the search box, type Action and you should see Google Actions API. Click on it.
  • At the top middle of the new page, click ENABLE.
  • Go back to the search box and type Function and you should see Google Cloud Functions API. Click on it.
  • At the top middle of the new page, click ENABLE.

The Google Actions API and Functions API are now enabled for your new project in the Google Cloud environment.

7. Lastly, create a Google Cloud Storage Bucket to store your Google Functions:

gsutil mb gs://three-doors-xxxx

Note: Replace xxxx with the four numbers you chose above.

This will provision a new Storage Bucket to host your code on the Google Cloud. You should be able to see the newly created bucket by running the command gsutil ls

Install and initialize gaction CLI

Now its time to get gaction environment set up. This is a command line tool you’ll run from a terminal that deploys your action.json file. This is the file that, once published and approved by Google, enables Google Assistant enabled devices such as Google Home or Allo on Android or iOS to access your Google Action application.

1. First point your browser to https://developers.google.com/actions/tools/gactions-cli and download the gactions command-line tool.

Shortcut from a terminal window, in your project directory, you can download directly using curl.

On Linux:

curl -O https://dl.google.com/gactions/updates/bin/linux/amd64/gactions/gactions

On Mac OS X:

curl -O https://dl.google.com/gactions/updates/bin/darwin/amd64/gactions/gactions

2. Next change permissions for gactions to executable:

chmod +x gactions

3. Within your project directory, initialize the tool inside a terminal window:

./gactions init

This will create a default action.json file in your project directory. Now we need to edit it.

Edit the action.json

Open action.json in your favorite editor.

Change the versionLable to 1.0 and change the projectId to three-doors-xxxx (replace with the project id you used when creating the project with gcloud CLI)

{
"versionLabel": "1.0",
"agentInfo": {
"languageCode": "en-US",
"projectId": "three-doors-xxxx",
"voiceName": "male_1"
},

When edited, it should look something like the above.

Next we’re going to edit actions section of the action.json file, which contains the list of possible thinks that the application will recognize. Every Google Assistant action.json will have an initialTrigger section. This is the action that will always be called first, so it’s also the most important one to get correct.

In this initial action.json file that was created with the actions init command, there are two initialTrigger sections. That is because an application can be called in multiple ways. We’re not going to worry about that in our first application so delete the second initialTrigger section.

The actions portion of our action.json file should now look similar to the following:

"actions": [
{
"description": "Launch intent",
"initialTrigger": {
"intent": "assistant.intent.action.MAIN"
},
"httpExecution": {
"url": "https://us-central1-three-doors-xxxx.cloudfunctions.net/three_doors"
}
}
]

Note: Replace xxxx with the four numbers you chose above.

Deploy your application to Google Cloud Functions

gcloud beta functions deploy three_doors --trigger-http --stage-bucket gs://three-doors-xxxx

Note: Replace xxxx with the four numbers you chose above.

This will deploy your Google Function, which is contained in the index.js source file (we’ll talk about that next time), into the Google Cloud. If this is the first time you’ve deployed the application, it will take some time; as much as 5 minutes. Don’t worry, updates don’t take nearly as long so it’s worth the wait of not having to manage your own servers.

Deploy your action.json test file

Next, it’s time to deploy your Action file (action.json). Use the command below. If this is the first time running the command, you may see instructions to enter a security token.

./gactions preview --action_package action.json --invocation_name "three doors" --preview_mins 1234

Running the gactions command above will turn on your new Action temporarily. Only you will be able to test it. This allows you to test the application in the command-line or web simulator or on your Google Home device. The google email you used to set up your cloud project will need to be the same account that manages your Google Home device.

Test your action

./gactions simulate

In the simulator, type:

talk to three doors

This should start the dialog.

When you run the command, there is also an option to start a web based simulator that will play the audio. You may be prompted to enable audio on your google gmail account.

Congratulations!

You just set up and deployed your first Action on Google. I’ve included a list of references below. In the next article in the series, we will look at the code and understand more how it works.

Please share this article if you found it useful or message me on twitter @eisenzopf

References

--

--

Jonathan Eisenzopf
Google Cloud - Community

Founder of Talkmap. Focused on #ai #machinelearning #cognitive #neural #psychology #sociology #nlu #nlp #semantics #cx