In most cases Pydantic won't be your bottle neck, only follow this if you're sure it's necessary.
In general, use model_validate_json() not model_validate(json.loads(...))¶
On model_validate(json.loads(...)), the JSON is parsed in Python, then converted to a dict, then it's validated internally.
On the other hand, model_validate_json() already performs the validation internally.
There are a few cases where model_validate(json.loads(...)) may be faster. Specifically, when using a 'before' or 'wrap' validator
on a model, validation may be faster with the two step method. You can read more about these special cases in
this discussion.
Many performance improvements are currently in the works for pydantic-core, as discussed
here. Once these changes are merged, we should be at
the point where model_validate_json() is always faster than model_validate(json.loads(...)).
The idea here is to avoid constructing validators and serializers more than necessary. Each time a TypeAdapter is instantiated,
it will construct a new validator and serializer. If you're using a TypeAdapter in a function, it will be instantiated each time
the function is called. Instead, instantiate it once, and reuse it.
fromtypingimportListfrompydanticimportTypeAdapterdefmy_func():adapter=TypeAdapter(List[int])# do something with adapter
fromtypingimportListfrompydanticimportTypeAdapteradapter=TypeAdapter(List[int])defmy_func():...# do something with adapter
When using Sequence, Pydantic calls isinstance(value, Sequence) to check if the value is a sequence.
Also, Pydantic will try to validate against different types of sequences, like list and tuple.
If you know the value is a list or tuple, use list or tuple instead of Sequence.
The same applies to Mapping and dict.
If you know the value is a dict, use dict instead of Mapping.
Don't do validation when you don't have to - use Any to keep the value unchanged¶
If you don't need to validate a value, use Any to keep the value unchanged.
Avoid wrap validators if you really care about performance¶
Wrap validators are generally slower than other validators. This is because they require
that data is materialized in Python during validation. Wrap validators can be incredibly useful
for complex validation logic, but if you're looking for the best performance, you should avoid them.
Starting in v2.8+, you can apply the FailFast annotation to sequence types to fail early if any item in the sequence fails validation.
If you use this annotation, you won't get validation errors for the rest of the items in the sequence if one fails, so you're effectively
trading off visibility for performance.
fromtypingimportListfromtyping_extensionsimportAnnotatedfrompydanticimportFailFast,TypeAdapter,ValidationErrorta=TypeAdapter(Annotated[List[bool],FailFast()])try:ta.validate_python([True,'invalid',False,'also invalid'])exceptValidationErrorasexc:print(exc)""" 1 validation error for list[bool] 1 Input should be a valid boolean, unable to interpret input [type=bool_parsing, input_value='invalid', input_type=str] """