Proposal: separate alexa request/response from http server

Question

Proposal: separate alexa request/response from http server

harrisonhjones opened this issue 7 years ago · comments

Found this project the other night. Looks great. Any thoughts about separating the http server components and the Alexa request/response into separate packages? I'm looking at developing my Alexa skills using a Lambda Go shim so I don't need the HTTP server part of this package but I like the SSML parts.

Also, I don't see a LICENSE for this package. How are you licensing it?

Mike Flynn · Answer 1 · Wed Oct 04 2017 10:23:43 GMT+0800 (China Standard Time)

This has come up a few times before but I haven't had a chance to dig in a refactor it to the point they are separated.

As for license, the skillserver dir originally had a GNU license file, but I just moved it to MIT with the file in the project root so GitHub picks it up.

Harrison Jones · Answer 2 · Wed Oct 04 2017 22:47:43 GMT+0800 (China Standard Time)

I'm happy to attempt the refactor but I'd like to coordinate it ahead of time. Here's my proposal:

New /customskill directory in root of go-alexa. Imported with github.com/mikeflynn/go-alexa/customskill. Inside of this directory:
- Move all request and respond related types to a types.go
- Move echo.go in here and split it up into cards, request, & response go files.
- Remove mentions of Echo in method calls as they are not needed
- Tempted to change things like RepromptSSML to SetRepromptSSML as it's clearer
- (Stretch) Write tests
New /ssml directory in root of go-alexa. Move SSML builder here
Refactor skill server to use the above

Thoughts?

Mike Flynn · Answer 3 · Fri Oct 06 2017 11:21:46 GMT+0800 (China Standard Time)

This is definitely along the same lines I've been thinking from the start (which is why it's been nested under go-alexa).

The project breakout makes sense and most of these moves should workable without too much logic changing, which is good. I'd hate to change too much on people in one shot as the breaking out of the projects is already a big change.

Anyone else want to weigh on this?

Harrison Jones · Answer 4 · Fri Oct 06 2017 11:56:45 GMT+0800 (China Standard Time)

Proposal for migration:

Add the new customskill and ssml directory and COPY code as needed
Update skillserver main logic to use these new packages
Mark duplicate code in skillserver as deprecated (From GoDoc: "To signal that an identifier should not be used, add a paragraph to its doc comment that begins with "Deprecated:" followed by some information about the deprecation.")
Don't contribute any new code to deprecated packages/types

That should be pretty painless for endusers. If they need the new features they can refactor but until they do it will continue to work.

Rob · Answer 5 · Fri Oct 06 2017 21:30:31 GMT+0800 (China Standard Time)

I think all of that sounds reasonable and a useful change for future development.

Mike Flynn · Answer 6 · Sun Oct 08 2017 05:08:10 GMT+0800 (China Standard Time)

Sounds good to me. It's only one package now, so the vast majority of people are just using the skillserver as a single piece, meaning they would probably not even know the difference, but since Go isn't big on versions, this is safest way to go.

Harrison Jones · Answer 7 · Sun Oct 08 2017 22:13:33 GMT+0800 (China Standard Time)

Great. I'm happy to start the refactoring but I'm a bit unsure the best way to send in a PR. I prefer small PRs to make them easy to review but I also don't want to commit unfinished code to mainline. How about I fork the repo, develop on a development branch, issue a PR against a development branch on this repo and then, once it's all stable (in a few PRs) issue a PR to bring it into mainline and remove the development branch? Thoughts?

Mike Flynn · Answer 8 · Mon Oct 09 2017 11:02:10 GMT+0800 (China Standard Time)

Yup, development branch is definitely the way to go for this.

I might have some free time later this week so let me know if you need any help, but this should be pretty straightforward I think.

Rob · Answer 9 · Mon Oct 09 2017 21:39:35 GMT+0800 (China Standard Time)

FYI i'm also working on dialog support in this PR, #21

i'll be submitting a new commit to add the type safety as discussed. I would imagine this would need to be refactored into possibly a dialog.go file as part of this work after its merged.

Mike Flynn · Answer 10 · Tue Oct 10 2017 11:43:07 GMT+0800 (China Standard Time)

Yup, we'll have to have a little air traffic control to get both of these projects landed, but lets just see which is done first and go from there.

Harrison Jones · Answer 11 · Wed Nov 15 2017 00:15:15 GMT+0800 (China Standard Time)

Hey @mikeflynn (and anyone else @rking788?) I'm working on the refactor for the custom skills request / response and I wanted y'all's input. As I have it now I've renamed EchoRequest (the high-level request struct) to Envelope (not married to the name) and built a custom JSON unmarshaller for it. You hand it the incoming question to your skill and you get an Envelope back. This Envelope has the following signature:

type Envelope struct {
	Version string  `json:"version"`
	Session Session `json:"session"`
	Context Context `json:"context"`
	Request interface{} `json:"request"`
}

The part I want feedback on is what I've done with the Request. I wanted to do away with our single Request type that has lots of fields that aren't used for any particular request (all request types smooshed into one) so Request is an interface{} and its populated automatically by the unmarhsaller with the appropriate *Request type. Unfortunately that means that you have to do some fancy work to actually read the data. You need to use reflection to check the type and then cast it to that type. Depending on how you implement your custom skill I don't think this is too much of a big deal. You simple do something like switching off of the type reflect returns and then go from there. Thoughts?

Rob · Answer 12 · Wed Nov 15 2017 00:22:07 GMT+0800 (China Standard Time)

Would it make sense to provider specific getters for the different request types that do the type assertion/casting and returns either that request type or an error if Request is not of that type?

Just seems like that would be convenient for cases where the client usually knows what type of request should be of a specific type but also leaves flexibility for clients to handle the Request property more dynamically if they want.

Harrison Jones · Answer 13 · Wed Nov 15 2017 00:36:10 GMT+0800 (China Standard Time)

@rking788 I don't think there is a way to avoid the casting by the client (if you can figure out a way I'd appreciate a code snippet). What we could do it something like Envelope.GetType which handles the reflection for the client and then all they need to do is perform the cast.

I don't think clients usually do know what the type of an incoming request is ahead of time. How would they (or maybe I am misunderstanding?)

Rob · Answer 14 · Wed Nov 15 2017 00:44:12 GMT+0800 (China Standard Time)

I think i need to go back and look at the Alexa request structure docs later tonight. Is the plan or the Request to be a custom struct type? or will it be more like a map[string]interface{}. I guess i assumed when you said they would switch off the type reflect that there was a predefined set of Request structs that could possibly be returned. The way things are defined now, isn't the EchoRequest type just a union of all the fields that are returned in the different request types?

Harrison Jones · Answer 15 · Wed Nov 15 2017 00:49:55 GMT+0800 (China Standard Time)

Is the plan or the Request to be a custom struct type? ... I guess i assumed when you said they would switch off the type reflect that there was a predefined set of Request structs that could possibly be returned.

Yes, that's what I was thinking. I've got request types for IntentRequest, SessionStart, etc

The way things are defined now, isn't the EchoRequest type just a union of all the fields that are returned in the different request types?

Yep. That's how it is now and while it works it's not ideally (imo) because it exposes a number of fields that have nothing to do with the request. Ideally clients should only be presented with an object that is 100% relevant to the request that is being made.

So you'd do something like

// inside of your switch's case "IntentRequest"
request, ok := envelope.Request.(intentRequest)

if !ok {
panic("Oh no, my switch statement is messed up")
}

// request is a intentRequest

Harrison Jones · Answer 16 · Thu Nov 16 2017 08:04:58 GMT+0800 (China Standard Time)

@rking788 I was thinking about this and I don't actually think the cast is needed by the client. We could provide something like the official alexa-sdk (for NodeJS) and let the client register handlers for each request type. These handlers would already take the specific *Request object as a parameter.

Top of my head I imagine something like this:

skill = customskill.New(w io.Writer)

skill.SetIntentRequestHandler(func(Request.IntentRequest) error)

and when a new request came in you would call skill.Handle(envelope string) which would parse the envelope and automatically call the appropriate registered handler.

Thoughts?

Rob · Answer 17 · Thu Nov 16 2017 08:34:04 GMT+0800 (China Standard Time)

that sounds like it could work. so the current skillserver does something similar to this I think. The EchoApplication struct already allows you to specify different handlers for OnLaunch, OnIntent, and OnSessionEnded. Would the plan be to register handlers based on intentName? or refine these different handlers to take specialized request types instead of their generic EchoRequest all of the handlers currently take.

I was looking into this some more and maybe we could do something similar to the type switch described on this page: https://github.com/golang/go/wiki/Switch

I think if there are specialized request type handlers then the next step is just to switch on the intent name though.

Mike Flynn · Answer 18 · Sun Nov 19 2017 15:02:49 GMT+0800 (China Standard Time)

Coming in late on this, but the handler for each type of request is in there and I think that's the best way to do it. I think API interactions like this can be easily overthought but the best way to go is to make the user (developer) clear about what they are trying to do (specifying a specific handler, etc) and then offload a much complexity as possible in to the library.

I agree the "all-in-one" request object was getting large, but it did allow the handlers to not have to worry about the input as much. The library started off with a single handler, but now that it's broken out to multiple handlers you could have each one expect a different request type without adding much complexity to the handler developer...which sounds exactly where you both ended up!

As for the "Envelope" name, fair enough that the "EchoRequest" name ended up not making much sense in relation to how Amazon evolved the product, but any new name should include "Request" in the name just so it's appropriately descriptive.

Harrison Jones · Answer 19 · Mon Nov 20 2017 13:08:01 GMT+0800 (China Standard Time)

I agree the "all-in-one" request object was getting large, but it did allow the handlers to not have to worry about the input as much. The library started off with a single handler, but now that it's broken out to multiple handlers you could have each one expect a different request type without adding much complexity to the handler developer...which sounds exactly where you both ended up!

I agree. Let me think about a way to address this properly and submit a PR for it. I obviously need to take another look at the existing logic before moving forward.

As for the "Envelope" name, fair enough that the "EchoRequest" name ended up not making much sense in relation to how Amazon evolved the product, but any new name should include "Request" in the name just so it's appropriately descriptive.

I'm on board with this. How about something like RequestEnvelope ? Alternately, and I'm not sure if I like this, we could treat the incoming request as two things: metadata (version & session) and the actual request and then pass those two things to the handlers: a new metadata object (doing away with the envelope entirely) and the actual request. Thoughts?

Harrison Jones · Answer 20 · Mon Nov 27 2017 01:32:33 GMT+0800 (China Standard Time)

@mikeflynn & @rking788 new PR for your review/thoughts: #28

Mike Flynn · Answer 21 · Thu Nov 30 2017 06:34:26 GMT+0800 (China Standard Time)

I'm back from vacation and catching up!

RequestEnvelope works for me. Not sure about the breaking things out at first blush. I'd need to see it in action...and I'm headed to your PR next so I'm guessing I'll see it there!

Rob · Answer 22 · Thu Nov 30 2017 10:52:13 GMT+0800 (China Standard Time)

still getting back into things after vacation too. taking a look at the PR now.

Harrison Jones · Answer 23 · Mon Jan 01 2018 07:25:16 GMT+0800 (China Standard Time)

With #28 merged into the refactor branch I thought we could talk about the remaining work to get this branch pushed to mainline/master. As per @mikeflynn's comments near the end of #28 I think there are a few different types of examples we need to provide: usage examples and skill examples. I've listed what I've come up with below.

Usage Examples

These examples illustrate how to use the customskill package in various forms. These are absolutely required to merge the refactor branch into mainline. I'm unsure where these examples should "live" so I would appreciate feedback @mikeflynn. Perhaps go-alexa/customskill/examples/usage/?

A complete app that works like skillserver does today, in which the dev has no existing app and just wants a server from nothing to respond to Alexa requests.
An Alexa Skill app that is easily integrated in to an existing go web application.
An existing web application that wants to cherry pick a subset of the features (maybe the security checking) but then handle the raw request itself.

I also suggested:

A example AWS Lambda Alexa skill

Skill Examples

These example illustrate how to build skills with different features (audio player, dialog, etc). All examples should come with SMAPI (https://developer.amazon.com/alexa-skills-kit/smapi) skill definitions, instructions on how to deploy and test them, and any assets required to run them. I don't think these are 100% required for the merger of the refactor branch but by implementing them we should discover any issues with the code before release and therefor make it easier to maintain the Go compatibility promise.

I can test this skill on an Echo Show and Echo Spot.

Playback Skill
- Use the PlaybackController Interface

This skill might be a bit hard to test as I am not sure which devices on the market make use of this interface. Perhaps the Echo Show and Echo Spot w/ on-screen controls?

Video Skill
- Use the VideoApp Interface

I can test this skill on an Echo Show and Echo Spot.

Stretch Skills

We should also consider skills which implement the following Alexa features:

Sending a Progressive Response
Modifying the customer's Shopping & To-Do Lists
Getting the device location

Note, some of these "stretch" skill ideas require calls to the Alexa API. We should consider creating a new sub-package for accessing this API.

Finally

Assuming we (@mikeflynn, @rking788, + anyone else :) ) all agree with the above I would appreciate some help hammering each example out. I'm happy to break it out into individual issues, assign owners (if anyone wants to take ownership of a particular task), and get it done. Let me know what y'all think.

Rob · Answer 24 · Wed Jan 03 2018 00:46:33 GMT+0800 (China Standard Time)

That plan sounds reasonable to me. I have never worked with SMAPI before so i'll had to look into how that works. Is it possible to refactor the existing example in the repo to work for the "base Alexa skill" example?

I would just recommend that any Lambda shim examples maybe be a low priority since official support is coming at some point, it may not be worth the trouble to build out the workaround example.

Harrison Jones · Answer 25 · Fri Jan 05 2018 01:30:23 GMT+0800 (China Standard Time)

The existing example at customskill/examples/http shows how to create a skill using go's standard http server. I would prefer we keep that one very simple. It should be updated so at least it works (and has some install instructions) but I won't build it out much.

I'm imaging the following folder structure:

customskill
- examples
  - usage
    - http
      - http example (existing) + instructions
    - lambda (future)
      - example + instructions
  - skills
    - basic
      - example + instructions
    - audioplayer
      - example + instructions
    - etc...

Does that work for everyone?

Akshay Kayastha · Answer 26 · Mon Dec 17 2018 13:04:35 GMT+0800 (China Standard Time)

@harrisonhjones @mikeflynn , been using the refactor branch. Has been really useful!
Although I need to add support for CanFullfilIntentRequest. I could try and submit a PR too.

Things, however, seem a little slow here. Any way I can help?

Mike Flynn · Answer 27 · Wed Jan 02 2019 11:25:05 GMT+0800 (China Standard Time)

Yeah, this refactor kept getting larger and then things stalled. Any help in additional changes, lists of what's needed, or testing would be appreciated!