Support Pydantic 2.0

Question

Support Pydantic 2.0

xl0 opened this issue a year ago · comments

Pydantic 2.0 removed create_model_from_typeddict() without a warning, which breaks pyairtable.

Can't say I agree with them just yanking functions without a deprecation warning, but that's on them.

I never used TypedDict. Is there a reason to use it instead of BaseModel, since we already depend on pydantic? @mesozoic , are you ok if I move the types to BaseModel?

Alex L. · Answer 1 · Wed Jul 12 2023 19:39:27 GMT+0800 (China Standard Time)

We use TypedDict to represent record and field dicts that get returned by Airtable. I don't think it's always practical for us to convert those into models, and it would be a significant change for library users whose code expects those objects as dicts. (Maybe in 3.0)

I do think there's a way to make this library work with both v1 and v2 (it involves conditional imports) but I think we can take our time doing it. As you pointed out, Pydantic 2 made a lot of breaking API changes, so I doubt many people are in a hurry to switch.

(FWIW I do have a couple branches in the works which use BaseModel for net new data structures, like comments and webhooks. Will post those soon.)

Alexey Zaytsev · Answer 2 · Wed Jul 12 2023 20:55:31 GMT+0800 (China Standard Time)

I see.

While the TypedDict code has not been around for a pyairtable release yet, people do expect something that behaves like a dict, right?

Would you entertain the idea of returning a subclasses of

class DictBaseModel(BaseModel):
    def __getitem__(self, key):
        return getattr(self, key)

instead of TypedDict?

Alex L. · Answer 3 · Wed Jul 19 2023 01:37:33 GMT+0800 (China Standard Time)

I think we can entertain any idea as long as it demonstrates value. Does this change help us add any new features or capabilities to the library?

Alexey Zaytsev · Answer 4 · Mon Jul 24 2023 14:03:21 GMT+0800 (China Standard Time)

I feel the main advantage is, Pydantic is a lot more popular than TypedDict, which means that

Everyone knows how to use it.
Oher libraries and tools integrate well with it.

Since PyAirtable already depends on Pydantic internally, it would make sense to also use it for the interface.
Pydantic is probably going to be the way forward for ORM. It would be confusing if part of PyAirtable interface is in Pydantic, and another part is in TypedDict.

And now would be a good time for this switch, before the TypedDict implementation hit a release. I think the change would be very straightforward. If you feel like it's worth it, I can implement it.

I have not used TypedDict before, so I don't know its limitations compared to Pydantic.

Alex L. · Answer 5 · Tue Aug 01 2023 05:41:16 GMT+0800 (China Standard Time)

TypedDict is part of the Python standard library; you can read more about it here.

I don't think it's confusing for parts of the pyAirtable API, like table.all(), to return dict, especially when those data structures are more or less "raw" passthroughs of what the Airtable API returns itself. Every Python developer knows how to deal with a dict and understands their behavior. Far fewer are familiar with Pydantic models and their quirks.

At this point the primary argument for not changing the return type of table.all() is for backwards compatibility. If we're going to replace that with Pydantic model instances (which implement __getitem__, maybe) then there's got to be some clear benefit in the form of new library capabilities. If there are specific examples of how this change might make it easier to integrate pyAirtable with other libraries, I'm all for hearing those.

For now I think the scope of this issue should be limited to supporting both Pydantic v1 and v2 (so we can minimize dependency conflicts) or, if that proves impossible, migrating to Pydantic v2 once it's clear that the older version of the library is no longer the dominant dependency (but we're not there yet).

Alex L. · Answer 6 · Fri Aug 11 2023 12:32:38 GMT+0800 (China Standard Time)

Resolved in #288 🙌