Expose user-defined meta-information via introspection API in form of directives

Question

Expose user-defined meta-information via introspection API in form of directives

OlegIlyenko opened this issue 7 years ago · comments

With growing popularity of IDL as a way to define a GraphQL schema, I think it would be quite beneficial to expose directive information via introspection API.

From what I can tell, the directive information is the only missing piece of information that is not obtainable via introspection API. For example in this schema definition:

type User {
  id: ID!
  name: String
  admin: Boolean! @important
}

type Query {
  user: User
}

@important directive is only available at schema materialization time, but discarded as soon as schema is materialized from AST definitions.

One can see directives as a way to instruct server to construct the schema is specific way. But I would argue that directives have also a lot of potential as a way to expose additional meta-information about the schema. This may include things like: field cost/complexity (the use case I'm quite interested in), auth information, caching characteristics, and in general any kind of structured meta-information that is specific to a particular application or subset of applications. I think a lot of interesting use-cases and scenarios can emerge as soon as this meta-information is exposed via introspection. I can also imagine community working together on a set of common directives that may be adopted by client tooling and frameworks like apollo and relay. These common directives (exposed via introspection API) may provide deeper insights on the schema, relations between fields and objects, etc.

I described this feature in context of IDL, but in fact it's quite independent from it (though integrates with it very naturally). I was thinking about different ways how this kind of user-defined meta-information can be exposed via introspection API and I feel that directive-based approach is the most natural and well integrated way of doing it.

I would love to hear you opinions on this topic!

Caleb Meredith · Answer 1 · Fri Apr 28 2017 11:05:58 GMT+0800 (China Standard Time)

field cost/complexity (the use case I'm quite interested in)

I had some experiments on this that I didn’t release. I’d love to hear your thoughts 😊

I agree that being able to put arbitrary information into introspection is incredibly powerful, but I don’t think that we should be translating directives one-to-one into the introspection. Directives are meta instructions for the tools which consume the IDL. Making them a first class part of introspection reveals too many implementation details. It would also be very tough to type the directive arguments well.

I’d rather see tools that can interpret directives and then translate that to fields in the introspection 😊. For example, a server could:

extend type __Field {
  important: Boolean
}

…and then no matter where you define your schema whether it be in the GraphQL IDL, GraphQL.js, or some other GraphQL server framework this flag can be set.

I don’t like the idea of making the IDL the one source of truth when creating a GraphQL server, but I do really like the idea of allowing users to extend the introspection types with arbitrary extra information.

Scott · Answer 2 · Fri Apr 28 2017 11:52:05 GMT+0800 (China Standard Time)

I don’t like the idea of making the IDL the one source of truth when creating a GraphQL server

Amen!

Scott

ΘLΞG · Answer 3 · Sun Apr 30 2017 07:28:07 GMT+0800 (China Standard Time)

I don’t think that we should be translating directives one-to-one into the introspection

I agree, the subset of directives that are used in the IDL may be completely different from subset of directives that are exposed via introspection (they may not even overlap)

I don’t like the idea of making the IDL the one source of truth when creating a GraphQL server

100% agree on this one. The whole idea is quite unrelated to IDL schema definition. Though if meta-information is exposed in a directive format, then some interesting scenarios can emerge. For example this end-to-end scenario falls naturally out of it:

"Internal Service 1" may use completely different set of directives at creation time than directives that are exposed via introspection to assemble the "Gateway" IDL. But using a directives is quite convenient since they are easily translated to/from IDL.

IDL aside, directives have an advantage that they are also easily introspectable though the API. But in general don't have a very strong opinion on the actual meta-information format. My main motivation is to somehow expose additional user-defined meta-information though introspection API.

Though I have a concern about the format you proposed:

extend type __Field {
  important: Boolean
}

If it is not defined with IDL, then server implementation somehow need to provide a way for user to define additional meta-fields on all GrapohQL schema-related classes and the type information about these fields somewhere on the schema. I think it can become a bit complex, especially for type-safe languages, also considering that with directives one can already cover the second part (directives already provide the type definitions for this meta-information, so there is no need to introduce a new concept for it)

Scott · Answer 4 · Sun Apr 30 2017 14:55:28 GMT+0800 (China Standard Time)

I think I know where you are heading with this, and I agree wholeheartedly, however, the solution isn't in GraphQL's own metadata injection system. Trying to extend it to cover more business use cases is the wrong direction. Up to this point, I've heard suggestions on authorization, validation, and of course, the data modeling itself (since it is part of GraphQL it is why so many are looking to GraphQL solutions to solve business domain problems).

I am going to go out on a limb here. The way I see it, Facebook has offered us a really cool way to build a gateway into our data. However, I am almost certain, they are only telling a partial story. I am convinced that they are doing metadata based development, where the metadata is the business logic itself, and GraphQL only offers (to those who should see it) access to that particular kind of data. When I see Lee Byron push back on suggestions like this and others, it is sort of dawning on me that Facebook is coming from another world of thinking and IMHO, it can only be metadata driven development.

Why is metadata driven development good? Because it puts the power of any change to business logic in the hands of the business.

In other words, once the metadata is set and known, then getting the business model (the domain model) together, programatically, is a matter of building it from the metadata. Tools can be offered to non-programmers to change the metadata. The same build-from-metadata goes for GraphQL endpoints too. In other words, metadata is the driver, not GraphQL schema. From the metadata, it would be a matter of translation into definitions for GraphQL, protobuffers, etc. The single source of truth is then the one set of metadata.

So, I guess what I am trying to say is, instead of trying to stuff all kinds of metadata inside GraphQL, we should be thinking about how we can let the metadata drive defining GraphQL schema.

Scott

Robert Mosolgo · Answer 5 · Sun Apr 30 2017 20:09:45 GMT+0800 (China Standard Time)

👍 I like the idea, I've had half a mind to implement it in Ruby anyways, since the IDL isn't showing signs of ever leaving RFC stage 😆

Thanks for sharing those thoughts about metadata-driven development. That's something interesting to think about, as the Ruby library grows, the Ruby API for schema definition is becoming more a hindrance than a help.

My thought has been to make the GraphQL schema be the metadata. Otherwise I have to invent yet another metadata structure which maps to GraphQL 😖

Robert Mosolgo · Answer 6 · Sun Apr 30 2017 20:11:03 GMT+0800 (China Standard Time)

I worried about portability, since different schema parsers might handle these inputs differently, but I thought I could just include a function to parse a schema then dump it without the custom directives.

Scott · Answer 7 · Sun Apr 30 2017 22:15:46 GMT+0800 (China Standard Time)

the Ruby API for schema definition is becoming more a hindrance than a help.

Yeah, it seems many people would like to turn their GraphQL system into a "God API", whereas it clearly should only be a relatively simple, but totally awesome gateway into the business logic layer of the application.

My thought has been to make the GraphQL schema be the metadata. Otherwise I have to invent yet another metadata structure which maps to GraphQL.

Yes, but the metadata can be the source of truth for the whole application (or applications), including the API. Think about validation, authorization, workflow, models, and whatever else is business driven. And, your answer tells me you are also still thinking in the wrong direction. The GraphQL API would be modeled after the metadata, not the other way around. 😄

Loopback does something similar to what I am talking about with its "API modeling" according to the modeled data.

Scott

ΘLΞG · Answer 8 · Sun Apr 30 2017 22:54:58 GMT+0800 (China Standard Time)

@smolinari you brought ups some very interesting points. Though my original intention was more about exposing additional information, rather then a way to aid the data modeling. I would definitely agree, directives indeed expose domain concerns. Even if we generate GraphQL schema based on some other data modeling tool, I think it's still very helpful to be able expose some meta-information via introspection API. Let's stick to this example with a gateway. Recently there was a great video by @bryan-coursera from Coursera on this topic. in particular, I found "Linking the resources" part quite interesting:

https://youtu.be/r-SmpTaqpDM?t=1479

If I understood it correctly, their internal services expose additional meta-information about relations between different models. I think directives can be quite useful in this respect for assembler/gateway service. For example schema of 2 internal services can look like this (I used IDL for demonstration, but it would be accessed via introspection in the actual app):

# "courses" service

type Course {
  id: ID!
  name: String
  subscriber: [ID] @reference(service: "users", rootField: "users", type: "User")
}

# "users" service

type Query {
  users(ids: [ID]): [User]
}

Gateway service then will discover these schemas via introspection API and expose Course type like this (with knowledge on how to resolve it correctly and efficiently using 2 other services):

# "gateway" service

type Course {
  id: ID!
  name: String
  subscriber: [User]
}

When it comes to data modeling, I think GraphQL IDL syntax can be a very good candidate for it. Over the years I saw quite a few tools and formats to declaratively model the data and the domain. Though looks like there is no tool that have seen very wide widespread. I feel that MDD (Model-Driven Development) has it's own set of challenges. I saw it taken quite a bit too far (in quite big production setups) where it becomes real pain to deal with (instead working application code, people a writing generator/tool code which adds additional layers of indirection and frustration). I feel that declarative data modeling fits quite well where the domain itself is very well established and understood.

Recently I saw several examples where GraphQL IDL is used in vary interesting data modeling scenarios. First example is graphql-up. Given this IDL type definition:

type User {
  id: ID!
  name: String
}

It will create a GraphQL schema that will contain User input/output types, relay API to read users and create/update new one, etc. So the IDL that you provide to graphql-up and a GraphQL schema that you end up with are very different. Using GraphQL IDL syntax to model the data in this case (actually any other syntax/language will do the trick in this scenario) has quite a few advantages:

There is already huge amount of tooling available for GraphQL, so it's easy to work with it (especially pragmatically), visualize it and do other interesting things to it
The syntax is familiar and well established, so the learning curve is much shorter, especially considering how nicely it correlates with the end result

Another very interesting adoption of GraphQL IDL syntax is contraband (Contraband is a description language for your datatypes and APIs, currently targeting Java and Scala. It would be part of the next version of scala build tool). As you can see, they adopted the IDL syntax, but changed it in a few interesting ways (including introduction of namespaces, YAY :)).

I see these two examples as a good validation of an idea that GraphQL IDL can be a viable tool for data modeling.

Scott · Answer 9 · Mon May 01 2017 11:41:09 GMT+0800 (China Standard Time)

Though my original intention was more about exposing additional information, rather then a way to aid the data modeling.

I understand. My intention also isn't really about aiding data modelling, but rather automatic generation of the API from a set of metadata. If you have that kind control over the metadata, and the metadata is also persisted in some manner, you can also control as much or little introspection of any of the "view" of any data you want. I realize this is getting quite esoteric, but try to think inside-out or rather, think that the API is something far, far away from a single source of truth. The API should be a window into the application's business layer in that it is only modelled after the domain models, which are (must be) defined elsewhere in the application. I am not saying this translation of metadata is easier, but overall, it is a lot easier than bending the API to all our business needs.

Right now, GraphQL is so cool and allows for so much, it is so flexible, people are starting to want to "model" everything in it, including the logic of what users can introspect. 😉 Whereas, these decisions of what to see or not, (no matter what is being controlled) is basically authorization logic and that is 100% business logic. Thus, it has or should have nothing to do with the internal workings of the API, except that there could be models burnt in metadata for the authorization too, which can also be generated as GraphQL schema, which can be made introspective ( or not, since we'll hopefully be able to generate schema/ the API automatically).

My simple and hard to fathom point is, the single source of truth cannot be the API/ the schema itself. It should only be fashioned after the applications single source of truth, and that is the business/ domain logic.

I know I have butted into similar discussions in other places about this. I might be getting on people's nerves because of it (who are also definitely loving GraphQL and its scene/ community). So, I think I've clarified my point as best I can here. I'll bow out now and let the conversation continue. Just let me warn everyone that making the API "too smart" is dumb and unnecessary. The hard work needs to go somewhere else in the depths of the server stack, which in the end, will make working with GraphQL overall, much easier. 😄

Scott

ΘLΞG · Answer 10 · Wed May 03 2017 06:52:48 GMT+0800 (China Standard Time)

@smolinari thanks a lot for a very detailed explanation! I think I can now better understand your point. I would definitely agree with it, there is much more to business logic of an application than what API can/should expose. I think it's also a good point to keep in mind as discussion around this issue progresses.

Greg Hurrell · Answer 11 · Wed May 03 2017 08:30:03 GMT+0800 (China Standard Time)

Interesting discussion. Thanks for starting it @OlegIlyenko. As you know, the role of directives as currently defined in the spec is pretty narrow; they are intended to:

[D]escribe alternate runtime execution and type validation behavior in a GraphQL document.

Exposing them via introspection (beyond __schema { directives { ... } }) would be a pretty large extension which we would want to evaluate carefully. My initial instinct is that exposing them like this would be overloading their purpose in a way that would increase the conceptual burden in an undesirable way, and I'd like to see some more exploration of specific use cases where having schema directives exposed via introspection would make things that are currently very difficult (or impossible) to do via other means significantly easier (or possible).

@OlegIlyenko: for example, you mentioned "field cost/complexity". Can you tell us more about that? We've certainly built tooling around that internally at FB, but it exists outside the schema (consuming the schema, developer queries/fragments, and runtime metrics as inputs).

Ivan Goncharov · Answer 12 · Wed May 03 2017 22:13:14 GMT+0800 (China Standard Time)

Expose IDL directive information via introspection API

@OlegIlyenko IMHO, IDL word in the title makes people think that the only way to expose this meta-information will be defining it inside IDL document. But nothing prevents you from specifying applied directives if you define the schema in the source code (with support from the server-side lib). So how about renaming it to:

Expose values of applied directives via introspection API

or something similar?

My initial instinct is that exposing them like this would be overloading their purpose in a way that would increase the conceptual burden in an undesirable way

@wincent I think it's a good solution to spec bloat. For example, according to the graphql-js implementation, you can deprecate field by using @deprecated directive, but in introspection, it is exposed through isDeprecated and deprecationReason fields. That means if I decide to have something like @deprecationDate I am forced to define new fields inside introspection, e.g. deprecationDate. The only way to safely achieve this will be pushing such directives and fields into the spec and this will lead to spec bloat.

To sum it up: GraphQL introspection should support mechanism for vendor extensions inside introspection and exposing applied directive values is a good solution for that.

I'd like to see some more exploration of specific use cases where having schema directives exposed via introspection would make things that are currently very difficult (or impossible) to do via other means significantly easier (or possible).

Here are a few examples from the top of my head:

@localizeName for enum values. I like that spec is limiting such names to ASCII but at the same time, there should be a possibility to specify localized name and use them on the client.
@relayMaxSliceSize which specify maximum number you can pass to first/last. It will allow implementing zero-config pagination
@examples for field arguments which can be used to generate better documentation (e.g. show them somewhere in graphiql when you type field arguments)

Caleb Meredith · Answer 13 · Thu May 04 2017 03:32:16 GMT+0800 (China Standard Time)

@OlegIlyenko have you considered introducing only a single directive in the IDL that maps well to introspection that would allow users to provide metadata? Something like @metadata. Users could then define (or extend) a __FieldMetadata type, or __FieldMetadata could be a scalar which accepts any JSON object. This could be represented in the IDL as:

type __FieldMetadata { important: Boolean }
# Or...
scalar __FieldMetadata

# We may also have a `__TypeMetadata` perhaps.
directive @metadata(field: __FieldMetadata)

type User {
  id: ID!
  name: String
  admin: Boolean! @metadata(field: { important: true })
}

(I may be getting the directive syntax wrong, feel free to edit this comment if it is wrong)

Or in the introspection query this would be modeled as:

{
  __type(name: "User") {
    fields {
      metadata { important }
    }
  }
}

This balances the need for attaching metadata to a GraphQL schema with the desire to not introducing special behavior around all directives in the IDL.

ΘLΞG · Answer 14 · Thu May 04 2017 06:10:55 GMT+0800 (China Standard Time)

@wincent

would be a pretty large extension which we would want to evaluate carefully

I definitely agree with this! Seeing all these great comments made me think a lot about the concept and it's soundness :) Now I discovered some new interesting perspectives on it.

you mentioned "field cost/complexity". Can you tell us more about that?

assuming that complexity calculation is a simple and static algorithm (like the one I used), it can be replicated in a client-side tooling given that the information about complexity of individual fields is available in some way (ideally though the introspection API).

This feature saved us already several times from unintentional expensive client queries. But when we start a dialog about why query was rejected by server and what query complexity/cost means, people always get confused since from a client perspective it's hard to predict (at least in more non-trivial cases) the overall complexity of the query in advance without communicating to the server (and then tweak it in order to ensure that complexity is below the threshold). I believe that by making this information more transparent we can avoid a lot of confusion around complexity estimation and help developers to write efficient queries. If this information is available though the introspection API, then the complexity calculation can be implemented as query validation rule which then can be used by existing linting tools (no modification is necessary to the tool itself). If we take this idea even further, one can develop a GraphiQL plugin that shows complexity of individual fields and field + nested selection set on mouseover. I think these kind of insights will be very helpful to client and server developers.

this would be overloading their purpose

I also share this concern. I think directives are convenient since after this change it would very easy to fully represent an introspection data in IDL syntax. I'm open to different syntax/encoding of this information. My main goal in this particular issue is to prove/disprove my hypothesis that it is useful/beneficial to expose user-defined meta-information via introspection API and benefits are worth added complexity. I just thought that it would be helpful to have some concrete syntax in examples.

@IvanGoncharov

It's an excellent point about deprecation! I haven't thought about it in this way, but now that you mentioned it, it makes perfect sense. Also if we want to, for instance, add a deprecation feature on other things, we can just update the directive itself without any changes to the introspection API. E.g.:

- directive @deprecated(reason: String) on FIELD_DEFINITION | ENUM_VALUE
+ directive @deprecated(reason: String) on FIELD_DEFINITION | ENUM_VALUE | OBJECT

I also like your other examples. I think they all are valid use-cases. Totally agree about the title, I think it caused quite a bit of confusion. I updated it to better reflect the original motivation.

@calebmer

I think it is an interesting idea and definitely worth considering. Though I personally would prefer not to mix disjointed concerns in a single type. With this approach we can end up with type like this one:

type __FieldMetadata {
  localizedName: LocalizedString
  complexity: Float
  example: String
}

I would rather prefer to see these as independent entities (like with the directives). This will also require introduction of 11 new types (__FieldMetadata, __EnumMetadata, __EnumValueMetadata, __ScalarMetadata, etc.).

Caleb Meredith · Answer 15 · Sat May 06 2017 00:00:06 GMT+0800 (China Standard Time)

@OlegIlyenko why would you not want to mix disjointed concerns in a single type? There are many ways to design the type to make mixing less confusing. Also, how is the example you gave for __FieldMetadata fundamentally different from using directives?

Also, if you think 11 new types is a bad thing (I don’t necessarily think so) then the specification could make the metadata types optional. We could also combine all metadata into one type: __Metadata.

The point is I agree that the ability to expose arbitrary metadata in both the JSON and IDL introspection formats is incredibly useful, but overloading directives may not be the right solution 😉. Is there some other directive-like syntax that could accomplish the same thing?

Ivan Goncharov · Answer 16 · Tue May 09 2017 00:10:51 GMT+0800 (China Standard Time)

why would you not want to mix disjointed concerns in a single type?

@calebmer Because you can't easily reuse different types.

In your scenario, a user needs to explicitly define __Metadata type with all fields in it and maintain it in sync so it provides fewer incentives for reusing existing metadata conventions.

On the other hand, let's take two directives from my previous post: @localizeName and @relayMaxSliceSize. You just need to append directives definition to the schema either in form of IDL or GraphQLDirective objects. Moreover, we can write a tool that detects directive usage and append appropriate definition automatically.

My main requirement for "Expose user-defined meta-information via introspection API" is to allow for flexibility but at the same time encourage people to reuse conventions.

Also, one technical issue with __Metadata type: It makes impossible to get introspection via static query since you don't know its fields in advance. So you first have to make query __type(name: "__Metadata") and only then form dynamic queries with all fields.

Here are additional arguments for using directives to expose metadata to the client:

Directive definitions are already exposed through introspection
directives can be tied to specific location
if you already use a directive to alternate runtime execution and type validation you can expose them to the client side. For example, if you have server-side validation you can use the same directives to power client side validation. So directives are the only way to configure server and client at the same time without duplication.

ΘLΞG · Answer 17 · Tue May 09 2017 01:26:07 GMT+0800 (China Standard Time)

@calebmer I definitely think __Metadata should be considered a valid alternative. Though I tend to agree with @IvanGoncharov's arguments. So it has it's own set of advantages and disadvantages, like any other approach. I guess it will boil down to a question which tradeoffs we a willing to take.

I also played with other ideas for a syntax. Maybe placement of a directive may decide whether it is exposed via introspection or not (usage side):

type User {
  id: ID!
  name: String
  
  @deprecated
  admin: Boolean! @important
}

Or allow directive to be exposed at a definition side with a modifier keyword like public or exposed:

exposed directive @deprecated(reason: String) on FIELD_DEFINITION | ENUM_VALUE

Another idea is to introduce new concept, like annotations. Syntactically it would similar to directives, but will provide better identification that these 2 things are meant for different purposes. Though I don't really like this idea that much, it adds too much complexity.

@wincent I was thinking about the directive spec definition for a while now:

[D]escribe alternate runtime execution and type validation behavior in a GraphQL document.

I would argue that @deprecated directive already deviates from this definition. Although it influences how schema is generated and can be used to validate a query against the schema, it's main purpose is to expose additional structured information about a field or enum value definition.

I guess it is just different perspective on looking at the same thing. I would rather define a directive as a way to provide additional structured information on different GraphQL entities. Server and clinet runtime then can take advantage of this information in different interesting ways (not only in terms of query execution or validation. These two are just valid use-cases). In fact, this is what spec defines as well:

Directives can be used to describe additional information for types, fields, fragments and operations.

So I feel that using directives in this context does not violate the spec definition.

Felix Becker · Answer 18 · Wed Nov 01 2017 02:21:50 GMT+0800 (China Standard Time)

I think it would be awesome to have this because it would allow the community to experiment with solutions to unsolved problems in GraphQL before/without putting it into the spec. For example, we could try out a directive to annotate what errors a field can cause, and then codegen tools can use that information.

Garen Torikian · Answer 19 · Tue Nov 21 2017 10:22:12 GMT+0800 (China Standard Time)

So I feel that using directives in this context does not violate the spec definition.

~~The annoying bit of this is that directives, at this time, cannot be applied to arguments, which means they cannot be given metadata in this way.~~ Not true! The IDL spec says yes, but I think this information is missing from the GraphQL spec.

Yordis Prieto · Answer 20 · Thu Nov 23 2017 21:58:19 GMT+0800 (China Standard Time)

Related to this is #376, where I basically need some soft of tagging for the mutations like @group('order') createGroup : MutationResponse so GraphiQL tool could do some grouping in the Doc section

Jimmy Jia · Answer 21 · Thu Dec 21 2017 03:19:11 GMT+0800 (China Standard Time)

I'd love to have this as well. While I understand that, per graphql/graphql-js#746 (comment), schema directives just seem like a very natural way to attach this sort of metadata to fields.

Actually, though this was a misreading on my part, I actually found it somewhat surprising that things didn't already work this way.

It seems like the sort of thing that should "just work".

Bojan Tomić · Answer 22 · Mon May 28 2018 06:32:01 GMT+0800 (China Standard Time)

It is important to not just blindly expose the existing directives through introspection as that would suddenly make them unfit for storing anything sensitive/internal, like security roles, permissions etc, which seems to be a common use-case in the wild.

Of course, there's various suggestions listed here that would work just fine. My intention was only to clearly state a concern.

Jimmy Jia · Answer 23 · Mon May 28 2018 07:22:17 GMT+0800 (China Standard Time)

Yup, makes total sense. I think it wasn't totally obvious to those of us coming from the side of using programmatically constructed schemas that people used directives for that purpose via the SDL. Some different syntax or a special carve-out is a must.

Felix Becker · Answer 24 · Tue May 29 2018 01:42:06 GMT+0800 (China Standard Time)

What do you mean by “existing directives”? There is only @deprecated atm

Bojan Tomić · Answer 25 · Tue May 29 2018 02:40:24 GMT+0800 (China Standard Time)

One is allowed to invent and use any directive they want. And a common use-case is a directive for authorization, akin to @auth(role:’manager’). Here's an example, and another one.
Simply exposing any directives present in the schema is hence rather dangerous.

Jimmy Jia · Answer 26 · Tue May 29 2018 02:47:15 GMT+0800 (China Standard Time)

Does anyone own this proposal, incidentally? The options I've seen floating around have looked like:

exposed directive @foo(...) on ...

decorator +foo(...) on ...

Or have a special @meta directive that is exposed...

And possibly with @deprecated getting merged into one of the above.

@xuorig I see that per these meeting notes https://github.com/graphql/graphql-wg/blob/27d27dbe7884c8c54798b3812b1076f3e7cde253/notes/2018-02-01.md#exposing-schema-metadata that you were looking at this. Did that lead to a concrete proposal?

Eric Dalquist · Answer 27 · Mon Jun 18 2018 13:01:38 GMT+0800 (China Standard Time)

Just another voice supporting the ability to customize the introspective types of the schema. I've played around a bit and personally and it seems like the most flexible approach.

Robert Mosolgo · Answer 28 · Mon Jun 18 2018 20:52:50 GMT+0800 (China Standard Time)

FWIW I added the option in GraphQL-Ruby to extend the introspection types: http://graphql-ruby.org/schema/introspection.html#customizing-introspection

But as far as I know, nobody has done it yet :P

Eric Dalquist · Answer 29 · Mon Jun 18 2018 22:13:53 GMT+0800 (China Standard Time)

That is very similar to a local patch to for the Java GraphQL library I have. It re-writes the Introspection class to contain a bunch of static methods that provide Builder instances. These are then made available during Schema building and there is an extension to Schema that provides references to the compiled __type and __schema meta fields as they exist for that specific Schema instance.

It is a fairly minimal change but then allows users of the graphql-java library to customize the schema introspection data as they see fit. Is this something I should look at sending over to graphql-java as a pull request?

Jack Timblin · Answer 30 · Mon Aug 06 2018 20:57:35 GMT+0800 (China Standard Time)

I think even a list of the names of the directives attached to the element would be helpful so it would be possible to query the schema for more detailed information about that directive. So the schema:

directive @important on FIELD_DEFINITION

type User {
  id: ID!
  name: String
  admin: Boolean! @important
}

type Query {
  user: User
}

We could update the definition of __Field on include a list of names of the directives attached to that field.

extend type __Field {
  directives: [__Directive!]
}

query {
    __schema {
         types {
             fields {
                  name
                  kind
                  directives {
                        name, args
                  }
             }
         }
    }
}

komkanit · Answer 31 · Mon Jan 14 2019 19:18:49 GMT+0800 (China Standard Time)

@jacklaaa89 I cannot extend type __Field
do you have any code example?

Benjie · Answer 32 · Mon Jan 14 2019 19:27:26 GMT+0800 (China Standard Time)

@komcal This is a proposal for the GraphQL specification itself. Types, fields, etc that start with a double underscore (__) are reserved for the GraphQL introspection system and can not / should not be modified in a user schema: https://facebook.github.io/graphql/draft/#sec-Reserved-Names

Bojan Tomić · Answer 33 · Thu Jan 24 2019 04:14:14 GMT+0800 (China Standard Time)

@edalquist Please do contact the graphql-java team about your idea.

Inaki Anduaga · Answer 34 · Fri Feb 15 2019 07:15:14 GMT+0800 (China Standard Time)

I just run into this thread while working on a schema stitcher "gateway" API that stitches different APIs and has to handle authorization. So far these API's were handling authorization / authentication on their own, however we want to move that layer to the stitcher in front of them. To do so without maintaining internal knowledge of the APIs in the stitcher, we need to be able to attach information to each field in the upstream APIs so we know what permission scope is required to access each field. Directives would be the perfect way to do this, alas the information is lost when introspecting. I don't know if we'll find a workaround, but basically agree that there should be a way to attach metadata to fields, and directives are already doing that, if only they could be exposable

Bojan Tomić · Answer 35 · Fri Feb 15 2019 08:21:21 GMT+0800 (China Standard Time)

@inakianduaga How would you prevent any random client from reading your authorization rules?

Inaki Anduaga · Answer 36 · Fri Feb 15 2019 16:29:06 GMT+0800 (China Standard Time)

How would you prevent any random client from reading your authorization rules?

Client -> GraphQL stitcher     -> Upstream API 1
                               -> Upstream API 2
                               -> ...

Clients only talk to the stitchers. Each upstream API can tag whatever nodes it wants from its schema with a scope value. The stitcher itself has the information about what permissions the user is allowed (via JWT or whatever means) and applies this restrictions programatically on the stitched schema without needing to know any details about the upstream APIs or synchronise any definitions.

This allows you to do better than simple authorisation, since you can completely hide the nodes a client is not allowed to see even when they perform an introspection. That way for the clients it's WYSIWYG, meaning everything they see on the schema is requestable, and they can't see what they are not allowed to request.

IF directive information where exposable, like this ticket wants, implementation would be straightforward

Benjie · Answer 37 · Fri Feb 15 2019 16:32:17 GMT+0800 (China Standard Time)

I think in @inakianduaga’s system access to the APIs behind the gateway would be blocked, so the information would not leak - when stitching these auth hints would be “consumed”.

An alternative approach could be to add an explicit “authMeta” root-level field that contains the auth information, and just ensure this is dropped while stitching. It’s definitely preferable to locate this information local to each GraphQL type though!

Bojan Tomić · Answer 38 · Fri Feb 15 2019 16:39:25 GMT+0800 (China Standard Time)

@inakianduaga I understand for your specific example, but I was thinking of the general case. Wouldn't it become necessary to always have a wrapper schema (to prevent security info leakage) if directives were introspectable?

Benjie · Answer 39 · Fri Feb 15 2019 16:44:06 GMT+0800 (China Standard Time)

I think the directives would have to add themselves to introspection. By default directives should not be exposed (so as to respect backwards compatibility). Really what we’re talking about here is extending the introspection types and adding additional fields containing custom metadata; directives are just a convenient pre-existing way to express these extensions via SDL.

Christian Kjær · Answer 40 · Fri Feb 15 2019 16:48:11 GMT+0800 (China Standard Time)

@inakianduaga How would you prevent any random client from reading your authorization rules?

I definitely understand your concern, but it doesn't matter, for us, if the client knows which resources are protected and which are not.

For example, we tag our queries and mutations with a custom @iam(key: "Stops.Create") directive. The user can do absolutely nothing with this knowledge, but we are able to enforce it various places. In fact, we want this information to be available for the developers introspecting the schema, so they know what things they need to check if the user has a feature for or not.

If you consider this leakage of information (security levels), then HTTP 403 is equally leaking, since it also tells you that something exists, but requires additional permissions.

Now, if you are putting confidential information into your directives, then yes, that is a problem.

By default directives should not be exposed (so as to respect backwards compatibility). Really what we’re talking about here is extending the introspection types and adding additional fields containing custom metadata; directives are just a convenient pre-existing way to express these extensions via SDL.

As @benjie suggests, I also think it should be an explicit action to expose a directive. E.g. in the server config, specify the directives that are exposed. This would alleviate the concerns about leakage, since it's now an explicit actions.

As an example, we are currently patching graphql-js in our own fork to enable us to use directives for IAM in our stitched schema.

Paul Damnhorns · Answer 41 · Fri Feb 15 2019 16:53:44 GMT+0800 (China Standard Time)

@Tehnix can you remove build files from your path? 17k lines in your diff 🤪

Bojan Tomić · Answer 42 · Fri Feb 15 2019 16:58:22 GMT+0800 (China Standard Time)

Certain implementations , e.g. graphql-java, enable you to dynamically decide what fields and even arguments are visible, both to introspection and in general. If this capability was to be standardized and extended to directives, all security related directives could themselves have access rules applied to their own visibility.

This way schemas that are never exposed directly (like the upstream schemas in your example) can have no access rules, while client-accessible schema can utilize them to prevent security related directives (or any other schema element for that matter) from being visible to introspection, and generally.

These two (introspectable directives, and dynamically controlled visibility of all elements) put together sounds like the combo that can cover any use-case.

Christian Kjær · Answer 43 · Fri Feb 15 2019 16:59:20 GMT+0800 (China Standard Time)

@Tehnix can you remove build files from your path? 17k lines in your diff 🤪

@nodkz We're patching it in yarn post-install where we pull in the library, so we need to patch the whole dist as well :) That is, we have the regular graphql-js at a matching version in our dependencies, and then we apply our patch afterwards, meaning we work with the build files.

If you only want the source patch, you can generate it with e.g. (notice that it now looks at src/*, whereas our own patch looks at dist/*),

$ git diff b14.0.0-rc.2 master src/* > add-iam-directive-14.0.0-rc.2.patch

Benjie · Answer 44 · Fri Feb 15 2019 17:11:06 GMT+0800 (China Standard Time)

(ASIDE: You can mark the files as generated in .gitattributes and then GitHub will hide their contents by default.

dist/** linguist-generated=true

https://help.github.com/articles/customizing-how-changed-files-appear-on-github/
)

Victor Andrée · Answer 45 · Sun Jun 23 2019 17:19:50 GMT+0800 (China Standard Time)

As far as the GraphQL specification is concerned, wouldn't it be sufficient to allow extending the Schema Introspection types, either with arbitrary fields or under a "safe" extensions field? Allowing a field extensions would be in line with how GraphQL allows custom fields on errors and the response map (this was noted by @IvanGoncharov in #543 (comment)). The Schema Introspection section could be amended to simply reserve a field named extensions:

Any type of the GraphQL schema introspection system can provide a field with name extensions. This field is reserved for implementors to extend the introspection schema however they see fit.

As @IvanGoncharov noted in the previously mentioned #543 (comment), "two-stage introspection" could be used to introspect what can be introspected on a target schema.

A concrete example would be to support the directives used in Apollo's GraphQL Federation, by adding the relevant metadata to __Type (for @keys and @extends) and __Field (for @external, @requires and @provides). This issue is under discussion under https://github.com/apollographql/apollo-server/issues/2769

scalar _FieldSet

type _TypeExtensions {
  keyFields: _FieldSet
  isExtension: Boolean!
}

extend type __Type {
  extensions: _TypeExtensions
}

type _FieldExtensions {
  isExternal: Boolean!
  requiresFields: _FieldSet
  providesFields: _FieldSet
}

extend type __Field {
  extensions: _FieldExtensions
}

If you require this metadata, you can do an initial introspection to see if the target supports it:

{
  Type: __type(name: "__Type") {
    fields {
      name
      type {
        kind
        name
      }
    }
  }

  Field: __type(name: "__Field") {
    fields {
      name
      type {
        kind
        name
      }
    }
  }
}

Once you know that your introspection schema supports the relevant fields, you run an extended introspection query.

{
  __schema {
    types {
      name
      extensions {
        keyFields
        isExtensions
      }

      fields {
        extensions {
          isExternal
          requiresFields
          providesFields
        }
      }
    }
  }
}

For those who want to expose "all directives" or arbitrary metadata, simply extend the introspection schema to support it.

Benjie · Answer 46 · Mon Jun 24 2019 00:24:56 GMT+0800 (China Standard Time)

I really like this idea - it's simple and powerful. I think the _TypeExtensions and _FieldExtensions would be arbitrarily-named user-defined types (hence no __ prefix, which agrees with what @victorandree has written) which would be supplied to the GraphQL schema itself (via the schema keyword in SDL, e.g. schema { query: Query, typeExtensions: _TypeExtensions, fieldExtension: _FieldExtensions }, or via the GraphQLSchema constructor in GraphQL.js).

Victor Andrée · Answer 47 · Mon Jun 24 2019 01:36:23 GMT+0800 (China Standard Time)

Using user-defined types for introspection schema extensions has two downsides, compared to allowing extend type __Type directly, however:

You'd actually have to introspect __Type to figure out what the user-defined extensions type is called, then introspect it to figure out what the extensions are, then do the extended introspection on the types themselves.
If you have multiple extensions from different sources – say one for authentication and one for federation – it's not obvious how you'd merge the different types under one field. However, both sources would know to extend type __Type since it's well-known.

This could be managed by every introspection schema type having an extensions field with a well-known name (e.g. __Type always has extensions: __TypeExtensions); or providing some other guarantee that additional fields on __Type et cetera wouldn't clash with a future GraphQL introspection schema (user-defined field must have some prefix, for example).

Added 2019-06-24: Object types must define one or more fields to be valid (see point 1 under Type validation: "An Object type must define one or more fields."), so the spec providing type __TypeExtensions without any fields would not be allowed under the current spec (but see #568).

Vladimir Alexiev · Answer 48 · Wed Jul 17 2019 21:42:13 GMT+0800 (China Standard Time)

Here's another simple use case.

For each field and object in a GraphQL schema, we want to split off label and descr so we can use the label as field label in some UI, and show the descr in a tooltip.

We've modeled it like this:

directive @descr(_:String!) on FIELD_DEFINITION | OBJECT
"ID" x_id: String @descr(_:"Identifier in source dataset. Single-value, optional")

However, we can't get @descr using introspection because __Field doesn't include directives. For this example it returns only this:

"name": "x_id",
"description": "ID"

Despite the discussion above that directives may be internal details of a server, I find this strange because __Directive definitions are included in introspection.

Benjamin Rood · Answer 49 · Wed Apr 15 2020 13:53:11 GMT+0800 (China Standard Time)

However, we can't get @descr using introspection because __Field doesn't include directives. For this example it returns only this:

@VladimirAlexiev

I've had to do something similar recently. Using graphql-tools process' of extending VisitSchemaDirective, and implementing the appropriate method (depending on what type of *_DEFINITION the directive is applied to), I injected the name of the field the directive was on and the value of any arguments into the GraphQLResolveInfo object in the appropriate shape.

Vladimir Alexiev · Answer 50 · Tue Apr 21 2020 19:41:42 GMT+0800 (China Standard Time)

@benjamin-rood We did a similar extension in the Ontotext Platform: http://platform.ontotext.com/tutorials/graphql-introspection.html
Eg in "what fields are available for an object", the standard introspection query

{
  __type(name: "Human") {
    name
    fields {
      name
      type {
        name
        kind
      }
    }
  }
}

returns eg this, although the "directives" payload was not called for:

      "fields": [
        {
          "name": "id",
          "type": {
            "name": null,
            "kind": "NON_NULL"
          },
          "directives": {
            "@descr": {
              "_": "Single, mandatory. Each RDF node has exactly one IRI."
            }
          }
        },

Ivan Maximov · Answer 51 · Wed Apr 22 2020 00:35:18 GMT+0800 (China Standard Time)

By the way, non standard introspection extensions can break ui tools like GraphQL Playground, GraphiQL, Voyager, etc. I was unpleasantly surprised when Voyager could not open due to my directives extension - graphql-dotnet/graphql-dotnet#1451.

Mark Fulton · Answer 52 · Wed May 27 2020 20:16:12 GMT+0800 (China Standard Time)

I would like to mark Fields as PII so that whenever a client uses a field they can check its metadata to see if it is PII (so that sensitive information can be hidden in screenshots and recording tools like Full story, LogRocket, etc.). I was thinking a directive would be a good solution for this but from what I can tell directives don't get serialized into the schema nor can they be introspected by the client. How can a client get metadata on types/fields/etc.?

Benjie · Answer 53 · Wed May 27 2020 20:52:57 GMT+0800 (China Standard Time)

@mfulton26 This proposal should enable this kind of thing in future; for now you've got a couple options, one is to use custom scalars (such as PIIString) to denote fields as PII, or you could find a way to add a "tag" to the description of a field - it's meant to be parsed as Markdown, so an HTML comment such as  in the description text may work. Hope this helps :)

Wojciech Trocki · Answer 54 · Wed Jun 03 2020 21:35:04 GMT+0800 (China Standard Time)

For anyone looking into this from the Node.js perspective (as CLI tool or server) I would recommend to check: https://github.com/aerogear/graphql-metadata

It kinda gives ability to define metadata on schema without any changes in the graphql parser etc. (it basically builds it's own local parser over the schema

Marc-Andre Giroux · Answer 55 · Fri Oct 23 2020 04:01:16 GMT+0800 (China Standard Time)

It seems like this issue hasn't explored the option of creating a 1-1 mapping between SDL schema directives and introspection directives very seriously 👇

{
  "data": {
    "__type": {
      "name": "Issue",
      "description": "An Issue is a place to discuss ideas, enhancements, tasks, and bugs for a project.",
      "directives": [{}, {}, {}]
  }
}

Schema directives are now fully in the specification, allowing schema providers to annotate various schema members, including types, fields, etc. I still haven't heard a convincing argument as to why they have to be treated differently from anything else in the schema when it comes to introspection.

The fear I hear the most is that frameworks using the SDL as a mean to construct a GraphQL server may leak directives that are meant for internal usage to the public. I think this concern can be dealt with at the framework/server level, by filtering anything not suited for client consumption out of the final schema. For examples, schema-first frameworks could have a strict allowList of directives that would make sure no directive is ever leaked implicitly.

To me, the SDL represents the actual schema, the contract between a client and a server, and directives are part of that. Personally I'd rather avoid using schema directives for internal behavior and configuration, but I understand it can be an attractive option. I still think the responsibility is then on the schema provider to filter those out from the final schema contract.

I may be missing another blocker here, but I want to bring back what I think is a much simpler, consistent, and symmetric approach to dealing with directives + introspection. Any thoughts?

Ivan Maximov · Answer 56 · Fri Oct 23 2020 04:27:41 GMT+0800 (China Standard Time)

The fear I hear...

No fear :) just a concern of extending spec when something always might go wrong.

I think this concern can be dealt with at the framework/server level, by filtering anything not suited for client consumption out of the final schema. For examples, schema-first frameworks could have a strict allowList of directives that would make sure no directive is ever leaked implicitly.

Exactly this was done in graphql-dotnet/graphql-dotnet#1451

To me, the SDL represents the actual schema, the contract between a client and a server, and directives are part of that.

I think the problem boils down to the fact that the directives were originally conceived as purely an internal part of the server. But time and experience have shown that directives are actually a great tool for clients as part of a server contract.

Somewhere in this thread or in graphql-dotnet/graphql-dotnet#1451, I already wrote that we have been using directives in introspection in production for a very long time. Yes, this required some modification of the GraphQL.NET package, we are using our fork. So far no problems! The only problem I noticed was that tools like the GraphQL Playground were not ready for unknown new fields in introspection. The tool refuses to work properly when it sees unknown fields. This is a famous vicious circle of what should be done first - change the specification or implementation. But in principle, there is an answer to this question (or a recommendation) that first you should make support for a new feature in the implementation (preferably in the reference library, graphql-js), and then approve the feature in the specification. After that, other implementations can relatively easily repeat the algorithm.

In our case, we provide information through directives about the necessary roles that the client must have in order to request certain fields. This is a cross-cutting entity that would require schema modification if being implemented in a different form.

@benjie @spawnia @IvanGoncharov I just want to ask if you plan to work on this issue in the coming years (not months)?

Benjie · Answer 57 · Fri Oct 23 2020 17:10:08 GMT+0800 (China Standard Time)

I'd love this feature to exist, and I always imagined it being exposed through SDL as directives no matter what the underlying introspection JSON results are; just like how deprecationReason is the introspection field, but that's exposed as @deprecated(reason: "...").

As for planning to work on it... I can't commit to anything right now (years or months); I've taken on far too many responsibilities and need to see some of those through before I can think about other things.

Benedikt Franke · Answer 58 · Mon Oct 26 2020 20:37:03 GMT+0800 (China Standard Time)

Some of the discussion around this issue has been shoeboxed by assuming a 1-1 relationship between directives in the SDL and directives in the introspection result. Those are really two separate things, given that GraphQL servers do not necessarly use SDL. How about we just call them user-defined introspection fields?

I see how symmetry with directives is useful for servers that use the SDL. For that reason, we should shape the introspection in a way that allows to represent the contents of directives, including repeatable directives. The actual source of the extra metadata may or not be directives.

Marc-Andre Giroux · Answer 59 · Mon Oct 26 2020 21:13:04 GMT+0800 (China Standard Time)

assuming a 1-1 relationship between directives in the SDL and directives in the introspection result. Those are really two separate things, given that GraphQL servers do not necessarly use SDL.

Why do you think they're two separate things? A field in the SDL is a field in the introspection result. These are GraphQL core concepts. GraphQL servers don't have to use SDL for directives to exist, they are in the specification 🤔

Benjie · Answer 60 · Mon Oct 26 2020 22:38:59 GMT+0800 (China Standard Time)

Everything that's representable via introspection should be representable via the GraphQL IDL and vice-versa. GraphQL tooling such as linting, change detection, etc often uses the IDL as the interchange format, rather than using the JSON of a GraphQL introspection query; this is sensible because the IDL should always be parseable and complete, whereas the introspection query may differ for different tools due to the flexibility of GraphQL's query language. It's important to consider both how this metadata will be exposed through GraphQL introspection (e.g. via the proposed "extensions" fields) and through the IDL (e.g. we're proposing here to use the directive syntax to expose this metadata in IDL, but we could come up with a different IDL representation for it if it's determined that using directive syntax for this does not make sense). Personally, I'm a fan of using directive syntax for this, because it already has form with deprecationReason being exposed as the @deprecated tag, though I do worry about how to differentiate user-defined metadata with future GraphQL official metadata.

Benedikt Franke · Answer 61 · Tue Oct 27 2020 00:03:16 GMT+0800 (China Standard Time)

@xuorig thanks for pointing that out, let me rectify my statement.

@benjie makes an interesting point, I agree that symmetry between the introspection result and an SDL representation thereof is nice. There are some definite advantages to being able to express metadata through the existing concept of directives.

Still, I think it is noteworthy to highlight the subtle difference between the directives used for schema definition (e.g. in SDL source code) and directives in the introspection result (e.g. the user facing schema, which may be a transformed version of the original SDL). There does not have to be a 1-1 relationship between those. As mentioned, some directives may be internal only, others may be external only. This is not really a concern of the spec, just something for implementors to be a aware of.

mmatsa · Answer 62 · Thu Oct 29 2020 23:46:44 GMT+0800 (China Standard Time)

@xuorig wrote:

For examples, schema-first frameworks could have a strict allowList of directives that would make sure no directive is ever leaked implicitly.

Instead of an allowList of directives, if we're proposing a change to the spec, how about putting it directly in the Schema (regardless of whether the Schema is from SDL or built-up programmatically) - on each directive definition.

It could work just like repeatable on the directive definition - there could be a new keyword like exposed that would mean that every use of this directive is exposed in the introspection result. By default it is false, therefore no change to any introspection result. If you add a new directive that you feel is intended for clients to see then you use the new keyword in your directive definition and all uses of that directive go to clients via introspection.

Using the example of @important from this issue (see top):

directive @important exposed on FIELD_DEFINITION
directive @statusquo on FIELD_DEFINITION


type User {
  id: ID!
  name: String @statusquo
  admin: Boolean! @important
}

type Query {
  user: User
}

Since @statusquo is an old directive that doesn't have the new exposed keyword, it is backwards-compatible and doesn't show up in introspection.

Since @important is marked as exposed it is shown on the introspection result everywhere - in this case on the admin field.

@spawnia wrote:

As mentioned, some directives may be internal only, others may be external only.

Basically the question is whether we should add a new boolean to distinguish those.

Ivan Maximov · Answer 63 · Fri Oct 30 2020 00:01:10 GMT+0800 (China Standard Time)

@mmatsa Your proposal makes sense. But it doesn't change in any way the simple fact that it still requires to extend introspection schema to request directives applied to the field (or to any other schema element). The client should be able to query these directives.

Marc-Andre Giroux · Answer 64 · Fri Oct 30 2020 00:52:25 GMT+0800 (China Standard Time)

Yeah, I think the exposed concept can be a whole different issue if the spec chooses to expose a directives property on the introspection. The specification could start with that change, libraries could handle private directives in "user land", try out approaches, and make a spec change if something great comes up.

In the mean time, another approach if the SDL is truly the only structure that can expose directives at the moment would be to handle that in something like https://github.com/graphql/graphql-over-http. At GitHub our API allows an SDL content-type (application/vnd.github.v4.idl) that will return the schema in SDL format. If we had a convention tools could at least work with a standardized endpoint/content type to fetch this information.

Still... having it in the introspection would be much better 😍 .

Ivan Maximov · Answer 65 · Fri Oct 30 2020 01:00:30 GMT+0800 (China Standard Time)

having it in the introspection would be much better

100% better :)

SDL or introspection are simply different ways of expressing the same information. It is strange if information about directives for some reason is available only in one form of presentation. I cannot think of a valid argument why this information should be hidden in introspection (it may be hidden though). In general, I'm 95% sure that the whole problem boils down to backward compatibility with existing tooling.

Marc-Andre Giroux · Answer 66 · Fri Oct 30 2020 01:03:19 GMT+0800 (China Standard Time)

SDL or introspection are simply different ways of expressing the same information.

💯

In general, I'm 95% sure that the whole problem boils down to backward compatibility with existing tooling.

It appears you've looked into this more than I am, with the graphql-playground example. The directives change appears to be an additive change, a change that should rarely break existing clients. Is it possible there wouldn't be that much work to make things backward compatible? We could work with different tools on a case by case basis to make them forward-compatible. It seems odd that adding a field would break clients, that's contrary to something that is core to GraphQL.

Ivan Maximov · Answer 67 · Fri Oct 30 2020 01:08:21 GMT+0800 (China Standard Time)

appears to be an additive change

:) yes, additive, but GraphQL Playground blows up

Is it possible there wouldn't be that much work to make things backward compatible?

Maybe yes, maybe now. Who knows? Only GraphQL Playground authors.

It seems odd that adding a field would break clients, that's contrary to something that is core to GraphQL.

Indeed, contrary :) but this is the reality. IIRC GraphQL Altair was more resistant to additional data in introspection and just showed a warning in UI. I haven't used it for a long time though.

Marc-Andre Giroux · Answer 68 · Fri Oct 30 2020 01:12:56 GMT+0800 (China Standard Time)

I guess where I'm going with this is that this feels like such an important change that it would be worth considering it even if it risks breaking some tools. There can be a good lead time, announcements, and in the worst case breaking older integrations might be worth it. I doubt this would be a dangerous change for most tools, but it would be nice to take a look at the data / check out how introspection is used by all major tooling.

Ivan Maximov · Answer 69 · Fri Oct 30 2020 01:22:33 GMT+0800 (China Standard Time)

There is not enough time for all these improvements in GraphQL WG.

Benjie · Answer 70 · Fri Oct 30 2020 02:06:03 GMT+0800 (China Standard Time)

if the SDL is truly the only structure that can expose directives at the moment would be to handle that in something like graphql/graphql-over-http

The problem with this approach is backwards compatibility; as we add new syntax to the SDL (e.g. inputUnion or tagged keywords, and related syntax for definitions) existing parsers would baulk because their parsers would not support these features. This differs to introspection, as generally introspection can be changed in an additive way that works with existing clients but allows newer clients to leverage more features (though might require the two-phase introspection that @IvanGoncharov has been pushing for). The GraphQL introspection API has to remain the source of truth IMO.

Marc-Andre Giroux · Answer 71 · Fri Oct 30 2020 02:07:20 GMT+0800 (China Standard Time)

Absolutely agreed @benjie 😄 Happy to help push this forward whenever the time is right.

Edit:

The GraphQL introspection API has to remain the source of truth IMO.

While I 1000% agree with this conceptually, it's worth noting that this is less and less true in practice because of this issue. A lot of tooling operates on SDLs when they could instead use introspection 👍

Samuel · Answer 72 · Mon Nov 09 2020 10:39:27 GMT+0800 (China Standard Time)

IIRC GraphQL Altair was more resistant to additional data in introspection and just showed a warning in UI

Regarding this, Altair introspects the schema using the introspectionQuery and buildClientSchema that is provided by graphql-js, but also has a fallback for the older introspectionQuery and older buildClientSchema before changes were made so it supports GraphQL servers still using the older GraphQL version (and warns the user about that) but to a limited capacity since all the GraphQL client functionalities (autocomplete, hinting, etc) depend on the graphql library being able to parse the schema properly, which it can't.

So no, Altair doesn't explode but it would be preferable if the changes are made backward compatible 🙂

Ivan Maximov · Answer 73 · Mon Nov 09 2020 17:30:21 GMT+0800 (China Standard Time)

@imolorhe I'm just wondering if you can make support for additional fields by simply ignoring all unknown parts in the response? This would be a good example and would probably move the discussed issue of "extended" introspection support off the ground.

Ivan Maximov · Answer 74 · Tue Feb 02 2021 05:38:55 GMT+0800 (China Standard Time)

Done in GraphQL.NET v4 (will be released in a week).

Brad Baker · Answer 75 · Fri Feb 26 2021 10:08:08 GMT+0800 (China Standard Time)

The graphql-java team is adding support for directives (and their argument values)

graphql-java/graphql-java#2221

Its opt in (you need to transform the base schema to make it available)

We have aped the "extensions" field idea from results and errors like this

type __InputValue {
  defaultValue: String
  description: String
  extensions: __InputValueExtensions
  name: String!
  type: __Type!
}
type __InputValueExtensions {
  directives: [__Directive!]!
  value: String
}

The used cases we have for exposing directives via Introspection is documentation generation>

eg imagine an @oauthscope(value="read:stuff") directive on a field - how would a consumer know that need "read:stuff" scope for that field if they can never say that in documentation. We plan to use "auto generated doc enhancement" based on introspection to do this (at Atlassian where I work).

Brad Baker · Answer 76 · Tue Mar 02 2021 20:02:05 GMT+0800 (China Standard Time)

After much internal discussions we have decided to use the shapes as as specified by the .Net graphql implementation as the basis for the graphql-java implementation.

We hope that this consistency helps the graphql world in the absence of a proper graphql specification mechanism for getting applied directives on schema elements.

See graphql-java/graphql-java#2221

Brad Baker · Answer 77 · Fri Mar 26 2021 16:51:57 GMT+0800 (China Standard Time)

😢

The graphql-java team and both major .Net graphql teams came up with a common implementation around a __AppliedDirective type in introspection.

However the graphql-js reference implementation pretty much rules this out.

See graphql-java/graphql-java#2221 (comment) for more details

In server terms it does not matter but what we saw was client side type generation failures (because the cli tools load the schema via graphql-js code)

So if the spec ever decided to update itself to some new __Foo naming - then there is a code time bomb out there in the world's most popular graphql implementation.

Benjamin Rood · Answer 78 · Fri Mar 26 2021 20:43:19 GMT+0800 (China Standard Time)

The first part of the solution is for the graphql-js implementation to not arbitrarily claim `__Foo` naming for schema introspection. The second part is for the schema spec to define a common standard covering this stuff, rather than deferring making a decision. Just decide, people. Just do it.

…

On Fri, 26 Mar 2021 at 10:52, Brad Baker ***@***.***> wrote: 😢 The graphql-java team and both major .Net graphql teams came up with a common implementation around a __AppliedDirective type in introspection. However the graphql-js reference implementation pretty much rules this out. See graphql-java/graphql-java#2221 (comment) <graphql-java/graphql-java#2221 (comment)> for more details In server terms it does not matter but what we saw was client side type generation failures (because the cli tools load the schema via graphql-js code) So if the spec ever decided to update itself to some new __Foo naming - then there is a code time bomb out there in the world's most popular graphql implementation. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#300 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACNCK5DKMRKBK5DROK4AI2TTFRDMBANCNFSM4DJL6RJQ> .

Shane Krueger · Answer 79 · Mon Apr 12 2021 01:02:54 GMT+0800 (China Standard Time)

So if the spec ever decided to update itself to some new __Foo naming - then there is a code time bomb out there in the world's most popular graphql implementation.

Isn't this a bug in the graphql-js implementation? I mean, the spec doesn't say that additional introspection types will not ever be defined, but simply that __ is reserved for introspection types. So client/tooling implementations should respect that for any fields and type that start with __. Am I missing something? Similarly, #649 added a new field to an introspection type. This should not cause clients to crash any more than adding a new introspection type should.

And then, does this affect typical client scenarios or just tooling? It would seem to me that typical clients are requesting data given a predefined query (not introspection fields) and expecting a certain format response. This does not change. It's only if they request (and incorrectly interpret) an introspection request, which is typically done during development or debugging. As tooling is generally brought up to date as development continues, that does not seem to be an major blocking issue either.
Of course, it will require graphql-js to fix the bug in their code soon.

The first part of the solution is for the graphql-js implementation to not arbitrarily claim __Foo naming for schema introspection.

I would say the first step is for the graphql-js implementation to consider ANY types/fields that start with __ as introspection types/fields, according to the current spec. This should be considered a bug fix.

The spec could also clarify this point immediately, specifying that additional introspection types and fields may be added in the future. You wouldn't think this would be necessary, as a major point of GraphQL is to be able to add types/fields to a schema without changing existing code...

Ivan Maximov · Answer 80 · Mon Apr 12 2021 19:39:20 GMT+0800 (China Standard Time)

@leebyron @benjie According to the activity in this repository it seemed to me that you are preparing the release of the next version of the specification. Please think about adding into the new version of at least suggestions to more clearly determine the ability to expand the introspection described above.

Benjie · Answer 81 · Wed Apr 14 2021 02:51:25 GMT+0800 (China Standard Time)

@sungam3r That is definitely out of scope for this current release; it's currently RFC 0. It will need to progress via the usual processes (i.e. being reviewed and having its stage advanced at GraphQL Working Groups) before it can be merged into the GraphQL spec.

Benjie · Answer 82 · Fri May 06 2022 22:21:00 GMT+0800 (China Standard Time)

@benjie, 23rd October 2020:

As for planning to work on it... I can't commit to anything right now (years or months); I've taken on far too many responsibilities and need to see some of those through before I can think about other things.

I'm planning to start working on this proposal again soon, and perhaps even become the champion of it. It might be a few months before you see anything out of this, but my initial plan is to gather together the various use cases and solutions into an RFC document so that we can analyse the status quo, and then present this to the WG. I do have another couple of RFCs that I'd like to advance in parallel (including #825 which advanced to RFC2 last night) so apologies if progress is slow 😉

mmatsa · Answer 83 · Fri May 06 2022 23:26:45 GMT+0800 (China Standard Time)

@benjie - Thanks! On my team we've gotten further in our implementation of it, quite slowly, busy like you. When you're ready to start working on it again, if we're available at the same time, we would be happy to collaborate. If we get somewhere first, we can update that here. As you said about yourself, we have too many other responsibilities to commit in advance, so if you have time and we don't then of course please continue without us - we'd be excited for you to get this in.

Benjie · Answer 84 · Sat May 28 2022 20:47:06 GMT+0800 (China Standard Time)

I will be talking about this (remotely 😞) at GraphQL Conf 2022, my talk "Schema Metadata" is currently expected to be "Tuesday June 7, 2022 2:50pm - 3:30pm CDT."

Michael Staib · Answer 85 · Sun May 29 2022 15:08:52 GMT+0800 (China Standard Time)

@benjie would be great to touch base on this RFC, GraphQL-Java and HotChocolate have implemented this and it could be great to have a zoom call with you, andy, and me.

Benjie · Answer 86 · Sun May 29 2022 15:36:56 GMT+0800 (China Standard Time)

Absolutely! (Also I mention this fact in my talk 👍)

Ivan Goncharov · Answer 87 · Sun May 29 2022 17:18:02 GMT+0800 (China Standard Time)

@benjie Thanks for bringing the conversation back, definitely the use case we need to solve.
It is also very important for Apollo and we discussed it internally.
I have an alternative proposal that I'm working on, it doesn't directly depend on directives.
It is still a rough proposal but I want to include it in a conversation so I will push spec PR in the next few days and we can discuss it on WG.

Shane Krueger · Answer 88 · Sun May 29 2022 22:46:29 GMT+0800 (China Standard Time)

@benjie would be great to touch base on this RFC, GraphQL-Java and HotChocolate have implemented this and it could be great to have a zoom call with you, andy, and me.

GraphQL.NET also has an implementation

Chris Newhouse · Answer 89 · Sun May 29 2022 23:12:33 GMT+0800 (China Standard Time)

I've been (eagerly) watching this issue since I found it while researching the idea of adding metadata to a GraphQL schema. My use case / intention was to utilize the metadata for the purpose of assisting the automated generation of GraphQL API documentation. For example, the metadata is used to provide "example" data for a field or type, or to add flags to indicate whether a particular thing should actually not be documented, etc.

I ended up creating an Apollo Server plugin to add metadata to the Introspection Query results on their way out of the server...and the API of that plugin also allows it to be used to weave metadata into an existing Introspection Query response's JSON.

If you read this blog post announcing the project (called SpectaQL), it mentions that one important aspect of our own implementation was to keep the definitions and documentation (i.e. "metadata" and descriptions) as close to one anther as possible.

While our implementation does not actually leverage directives or SDL for the metadata, it would be great if an officially supporting implementation allowed for this characteristic as well. That way developers could easily adjust the metadata when they create or update the schema definition, and downstream consumers (like SpectaQL) would just pick up those changes.

I thought it was worth chiming in at this moment...thank you for all your work on this idea!!!

Benjie · Answer 90 · Wed Jun 08 2022 19:20:27 GMT+0800 (China Standard Time)

(For anyone who missed my talk on schema metadata at GraphQL Conf and wanted to catch up on what I see as the main proposals and why this is a challenging problem, here's a rehearsal video: https://youtu.be/c1oa7p73rTw)

Benjie · Answer 91 · Thu Jun 09 2022 01:32:38 GMT+0800 (China Standard Time)

Mostly as a note to myself (sorry if it's a bit incoherent) my current feeling (having had a couple chats since the event) is:

Extensible meta types (object types, with restrictions)
Meta types must not form cycles (perhaps like input objects, perhaps stricter)
Meta types must not have arguments on their fields
When a non-scalar type T is referenced (including transitively) in a meta type, T must not be referenced in the main (non-introspection) schema (i.e. no overlap in composite/abstract type names)
Metadata is static data that should be known at schema build time (it will not change)

Potential SDL syntax: ?{ ... } after field directives; e.g.

type User {
  id: ID!

  """Description"""
  friends(first: Int): [User!] @connection ?{
    companyName_ownership: {
      currentOwner: "Alice"
      ownerSince: "2022-01-01T00:00:00Z"
      previousOwners: ["Bob", "Caroline", "Dave"]
    }
    graphile_database: {
      __typename: "_MetaPostgresDatabase" # Present if `graphile_database` has an abstract type
      schema: "public"
      table: "users"
    }
    relay_pagination: {
      maxFetch: 100
    }
    companyName_history: {
      lastEdited: "2022-06-08T18:20:00Z"
      editedBy: "Benjie"
    }
  }
}

Best practice: every key under the metadata should have an object as the value to act as a namespace and to allow for future expansion. These keys should involve the company or project name to help avoid conflicts.

Note: the use of ?{ is just an example; it could equally be +{ or + meta { or %meta(data: {...}). There are two important parts in my opinion:

it must be a symbol so that we can do unambiguous negative lookahead
I think we should use { to represent the metadata as an object.

I need to explore adding this to types and other locations; the {} + {} syntax may be undesirable there.

Glen · Answer 92 · Sun Jun 19 2022 23:17:16 GMT+0800 (China Standard Time)

What about putting the metadata in front of the element, similar to attributes/annotations?

type User {
  id: ID!

  """Single-line"""
  <% relay: { pagination: { maxFetch: 100 }, something: { else: "value" } %>
  friends(first: Int): [User!] @connection
  
  """Multi-line"""
  <%
    relay: {
      pagination: {
        maxFetch: 100
      }
    }
    something: {
      else: "value"
    }
  %>
  friends(first: Int): [User!] @connection
}

Carlo Corradini · Answer 93 · Fri Oct 07 2022 20:10:45 GMT+0800 (China Standard Time)

Any update on this? 😥

Michael Bleigh · Answer 94 · Fri Jan 05 2024 08:46:49 GMT+0800 (China Standard Time)

Skimming through this megathread (sorry, couldn't read every post) I didn't see a proposal for what I'm looking for here. Fundamentally I believe introspection should be a way to effectively reconstruct an equivalent picture of the SDL source through a programmatic API. This is the case currently for everything except the application of directives, which is strange when you consider that the directives themselves are included.

Use Case

I have a validation directive that defines validation logic for a field:

directive @validate(
  regex: String,
  minLength: Int,
  maxLength: Int,
  message: String
) repeatable on FIELD_DEFINITION

My GraphQL server recognizes this directive and appropriately validates inputs. However, I'm also generating client libraries and when a validated field is provided in an input I want to offer client-side validation. If directive applications were included in introspection, I could scan fields for validation rules and add them to my client side logic, offering me a great experience because I can do "live" local validation instead of having to call the server.

Today, this isn't possible unless I have access to the raw SDL file because even though I can understand the @validate directive exists, I can't see where it's applied.

Proposal

Add a new __DirectiveApplication to introspection and add it to the introspection data of every place where directives can be applied:

type __DirectiveApplication {
  "The name of the directive applied at this location."
  directive: String!
  "The location of this directive application."
  location: __DirectiveLocation
  "The arguments for this directive application."
  args: [__DirectiveApplicationArgumentValue]
}

"Arguments provided to Fields or Directives and the input fields of an InputObject are represented as Input Values which describe their type and optionally a default value."
type __DirectiveApplicationArgumentValue { # needs a better name
  name: String!
  type: __Type!
  "A GraphQL-formatted string representing value for this directive argument."
  value: String
}

extend type __Type {
  directives: [DirectiveApplication!]
}

extend type __Field {
  directives: [DirectiveApplication!]
}

extend type __Schema {
  directives: [DirectiveApplication!]
}

extend type __InputValue {
  directives: [DirectiveApplication!]
}

# ...etc if I'm forgetting any

Rather than inventing a whole new syntax or complex mechanism for schema annotations, just provide the raw information of which directive was applied at the place it's applied and tooling can sort it out.

The only "ugly" part of this in my opinion is the value being a GraphQL formatted string since GQL doesn't have an Any type, but this is already present in introspection for defaultValue on __InputValue.

Am I missing something obvious as to why this isn't reasonable/feasible?

Note: The RFC for annotations from @benjie does solve this problem as well, and even more robustly, but I worry that generating per-directive struct types might make it a hard sell vs. something that can more easily be integrated with existing systems.

Benjie · Answer 95 · Fri Jan 05 2024 18:12:15 GMT+0800 (China Standard Time)

One issue with the defaultValue stringified literal approach is that it requires a GraphQL-capable parser to make sense of the data (JSON.parse() is not sufficient, and adding a parser will increase bundle size for web clients), and even then that's not sufficient since custom scalars can have their own parse/deparse rules which are not encoded via the schema. There's @specifiedBy but that currently doesn't provide a machine-readable parser. defaultValue itself also suffers this issue, but it's extremely rare that any application client will actually use the defaultValue for anything, typically it's just rendered in docs/GraphiQL/etc and the fact or "has default" is enough for most application logic.

I definitely agree that sharing validation logic is one of the use cases of metadata directives 👍

Martin Bonnin · Answer 96 · Fri Jan 05 2024 18:24:19 GMT+0800 (China Standard Time)

Has there been any talks about modeling GraphQL values as JSON objects in introspection?

GraphQL value:

{ a: [0, 1, 2], b: 3.14, c: "foo" }

Introspection result:

{
  "__typename": "__ObjectValue",
  "value": {
    "a": {
      "__typename": "__ListValue",
      "value": [
        { "__typename":  "__IntValue", "value":  0},
        { "__typename":  "__IntValue", "value":  1},
        { "__typename":  "__IntValue", "value":  2}
      ]
    },
    "b": {
      "__typename": "__FloatValue",
      "value": 3.14
    },
    "c": {
      "__typename": "__StringValue",
      "value": "foo"
    }
  }
}

Sure it's verbose but JSON is already verbose so it might be ok? And clients that don't need the actual values but just the presence of a value could just not request it.

Benjie · Answer 97 · Fri Jan 05 2024 18:45:44 GMT+0800 (China Standard Time)

@martinbonnin You're essentially talking about "boxed types" here, or, arguably, an AST with type annotations.

I hope that the struct RFC solves this in a more elegant fashion: https://github.com/graphql/graphql-wg/blob/main/rfcs/Struct.md

Essentially it allows the defaultValue of { a: [0, 1, 2], b: 3.14, c: "foo" } to be output as the JSON object { "a": [0, 1, 2], "b": 3.14, "c": "foo" } and clients can still interpret it type-safely since they know the underlying types from the schema. This is much more GraphQL-y than an AST approach IMO, since it's more what the client would desire to deal with.

Martin Bonnin · Answer 98 · Fri Jan 05 2024 19:37:15 GMT+0800 (China Standard Time)

@benjie structs looks great, thanks for the link 👍.

Back to @mbleigh question, I'm going to link @benjie excellent video that discusses exactly that issue: https://youtu.be/c1oa7p73rTw:

timestamp for __DirectiveApplication
timestamp for InputValidation

My understanding is that structs would alleviate most of the drawbacks there so looking forward to them :)!