2006-06-29 00:00:37 +08:00
|
|
|
==========================
|
|
|
|
Serializing Django objects
|
|
|
|
==========================
|
|
|
|
|
|
|
|
.. note::
|
2006-09-26 01:44:07 +08:00
|
|
|
|
2006-06-29 00:00:37 +08:00
|
|
|
This API is currently under heavy development and may change --
|
|
|
|
perhaps drastically -- in the future.
|
2006-09-26 01:44:07 +08:00
|
|
|
|
2006-06-29 00:00:37 +08:00
|
|
|
You have been warned.
|
2006-09-26 01:44:07 +08:00
|
|
|
|
2006-06-29 00:00:37 +08:00
|
|
|
Django's serialization framework provides a mechanism for "translating" Django
|
|
|
|
objects into other formats. Usually these other formats will be text-based and
|
|
|
|
used for sending Django objects over a wire, but it's possible for a
|
|
|
|
serializer to handle any format (text-based or not).
|
|
|
|
|
|
|
|
Serializing data
|
|
|
|
----------------
|
|
|
|
|
|
|
|
At the highest level, serializing data is a very simple operation::
|
|
|
|
|
|
|
|
from django.core import serializers
|
|
|
|
data = serializers.serialize("xml", SomeModel.objects.all())
|
2006-09-26 01:44:07 +08:00
|
|
|
|
2006-06-29 00:00:37 +08:00
|
|
|
The arguments to the ``serialize`` function are the format to serialize the
|
|
|
|
data to (see `Serialization formats`_) and a QuerySet_ to serialize.
|
|
|
|
(Actually, the second argument can be any iterator that yields Django objects,
|
|
|
|
but it'll almost always be a QuerySet).
|
|
|
|
|
2007-04-24 13:58:03 +08:00
|
|
|
.. _QuerySet: ../db-api/#retrieving-objects
|
2006-06-29 00:00:37 +08:00
|
|
|
|
|
|
|
You can also use a serializer object directly::
|
|
|
|
|
2007-03-22 20:54:23 +08:00
|
|
|
XMLSerializer = serializers.get_serializer("xml")
|
|
|
|
xml_serializer = XMLSerializer()
|
2006-06-29 00:00:37 +08:00
|
|
|
xml_serializer.serialize(queryset)
|
|
|
|
data = xml_serializer.getvalue()
|
2006-09-26 01:44:07 +08:00
|
|
|
|
2006-06-29 00:00:37 +08:00
|
|
|
This is useful if you want to serialize data directly to a file-like object
|
|
|
|
(which includes a HTTPResponse_)::
|
|
|
|
|
|
|
|
out = open("file.xml", "w")
|
|
|
|
xml_serializer.serialize(SomeModel.objects.all(), stream=out)
|
|
|
|
|
|
|
|
.. _HTTPResponse: ../request_response/#httpresponse-objects
|
|
|
|
|
2007-06-01 21:39:08 +08:00
|
|
|
Subset of fields
|
|
|
|
~~~~~~~~~~~~~~~~
|
|
|
|
|
2007-11-04 10:08:24 +08:00
|
|
|
If you only want a subset of fields to be serialized, you can
|
2007-07-01 09:00:23 +08:00
|
|
|
specify a ``fields`` argument to the serializer::
|
2007-06-01 21:39:08 +08:00
|
|
|
|
|
|
|
from django.core import serializers
|
|
|
|
data = serializers.serialize('xml', SomeModel.objects.all(), fields=('name','size'))
|
|
|
|
|
2007-07-01 09:00:23 +08:00
|
|
|
In this example, only the ``name`` and ``size`` attributes of each model will
|
2007-11-04 10:08:24 +08:00
|
|
|
be serialized.
|
2007-06-01 21:39:08 +08:00
|
|
|
|
|
|
|
.. note::
|
|
|
|
|
|
|
|
Depending on your model, you may find that it is not possible to deserialize
|
2007-06-02 01:09:54 +08:00
|
|
|
a model that only serializes a subset of its fields. If a serialized object
|
2007-06-01 21:39:08 +08:00
|
|
|
doesn't specify all the fields that are required by a model, the deserializer
|
|
|
|
will not be able to save deserialized instances.
|
|
|
|
|
2008-06-09 22:03:35 +08:00
|
|
|
Inherited Models
|
|
|
|
~~~~~~~~~~~~~~~~
|
|
|
|
|
|
|
|
If you have a model that is defined using an `abstract base class`_, you don't
|
|
|
|
have to do anything special to serialize that model. Just call the serializer
|
|
|
|
on the object (or objects) that you want to serialize, and the output will be
|
|
|
|
a complete representation of the serialized object.
|
|
|
|
|
|
|
|
However, if you have a model that uses `multi-table inheritance`_, you also
|
|
|
|
need to serialize all of the base classes for the model. This is because only
|
|
|
|
the fields that are locally defined on the model will be serialized. For
|
|
|
|
example, consider the following models::
|
|
|
|
|
|
|
|
class Place(models.Model):
|
|
|
|
name = models.CharField(max_length=50)
|
|
|
|
|
|
|
|
class Restaurant(Place):
|
|
|
|
serves_hot_dogs = models.BooleanField()
|
|
|
|
|
|
|
|
If you only serialize the Restaurant model::
|
|
|
|
|
|
|
|
data = serializers.serialize('xml', Restaurant.objects.all())
|
|
|
|
|
|
|
|
the fields on the serialized output will only contain the `serves_hot_dogs`
|
|
|
|
attribute. The `name` attribute of the base class will be ignored.
|
|
|
|
|
|
|
|
In order to fully serialize your Restaurant instances, you will need to
|
|
|
|
serialize the Place models as well::
|
|
|
|
|
|
|
|
all_objects = list(Restaurant.objects.all()) + list(Place.objects.all())
|
|
|
|
data = serializers.serialize('xml', all_objects)
|
|
|
|
|
|
|
|
.. _abstract base class: http://www.djangoproject.com/documentation/model-api/#abstract-base-classes
|
|
|
|
.. _multi-table inheritance: http://www.djangoproject.com/documentation/model-api/#multi-table-inheritance
|
|
|
|
|
2006-06-29 00:00:37 +08:00
|
|
|
Deserializing data
|
|
|
|
------------------
|
|
|
|
|
|
|
|
Deserializing data is also a fairly simple operation::
|
|
|
|
|
|
|
|
for obj in serializers.deserialize("xml", data):
|
|
|
|
do_something_with(obj)
|
2006-09-26 01:44:07 +08:00
|
|
|
|
2006-06-29 00:00:37 +08:00
|
|
|
As you can see, the ``deserialize`` function takes the same format argument as
|
|
|
|
``serialize``, a string or stream of data, and returns an iterator.
|
|
|
|
|
|
|
|
However, here it gets slightly complicated. The objects returned by the
|
|
|
|
``deserialize`` iterator *aren't* simple Django objects. Instead, they are
|
|
|
|
special ``DeserializedObject`` instances that wrap a created -- but unsaved --
|
|
|
|
object and any associated relationship data.
|
|
|
|
|
|
|
|
Calling ``DeserializedObject.save()`` saves the object to the database.
|
|
|
|
|
|
|
|
This ensures that deserializing is a non-destructive operation even if the
|
|
|
|
data in your serialized representation doesn't match what's currently in the
|
|
|
|
database. Usually, working with these ``DeserializedObject`` instances looks
|
|
|
|
something like::
|
|
|
|
|
|
|
|
for deserialized_object in serializers.deserialize("xml", data):
|
|
|
|
if object_should_be_saved(deserialized_object):
|
2008-02-19 07:43:12 +08:00
|
|
|
deserialized_object.save()
|
2006-09-26 01:44:07 +08:00
|
|
|
|
2006-06-29 00:00:37 +08:00
|
|
|
In other words, the usual use is to examine the deserialized objects to make
|
|
|
|
sure that they are "appropriate" for saving before doing so. Of course, if you trust your data source you could just save the object and move on.
|
|
|
|
|
|
|
|
The Django object itself can be inspected as ``deserialized_object.object``.
|
|
|
|
|
|
|
|
Serialization formats
|
|
|
|
---------------------
|
|
|
|
|
2006-06-30 00:42:49 +08:00
|
|
|
Django "ships" with a few included serializers:
|
2006-06-29 00:00:37 +08:00
|
|
|
|
2006-06-30 00:42:49 +08:00
|
|
|
========== ==============================================================
|
|
|
|
Identifier Information
|
|
|
|
========== ==============================================================
|
|
|
|
``xml`` Serializes to and from a simple XML dialect.
|
|
|
|
|
|
|
|
``json`` Serializes to and from JSON_ (using a version of simplejson_
|
|
|
|
bundled with Django).
|
|
|
|
|
|
|
|
``python`` Translates to and from "simple" Python objects (lists, dicts,
|
2006-09-26 01:44:07 +08:00
|
|
|
strings, etc.). Not really all that useful on its own, but
|
2006-06-30 00:42:49 +08:00
|
|
|
used as a base for other serializers.
|
2007-11-04 10:08:24 +08:00
|
|
|
|
2008-05-24 13:15:21 +08:00
|
|
|
``yaml`` Serializes to YAML (YAML Ain't a Markup Language). This
|
2007-11-04 10:08:24 +08:00
|
|
|
serializer is only available if PyYAML_ is installed.
|
2006-06-30 00:42:49 +08:00
|
|
|
========== ==============================================================
|
|
|
|
|
|
|
|
.. _json: http://json.org/
|
|
|
|
.. _simplejson: http://undefined.org/python/#simplejson
|
2007-06-01 21:39:08 +08:00
|
|
|
.. _PyYAML: http://www.pyyaml.org/
|
2006-06-30 00:42:49 +08:00
|
|
|
|
2006-09-26 01:44:07 +08:00
|
|
|
Notes for specific serialization formats
|
2006-09-22 21:26:07 +08:00
|
|
|
----------------------------------------
|
|
|
|
|
|
|
|
json
|
|
|
|
~~~~
|
|
|
|
|
2006-09-26 01:44:07 +08:00
|
|
|
If you're using UTF-8 (or any other non-ASCII encoding) data with the JSON
|
2006-09-22 21:26:07 +08:00
|
|
|
serializer, you must pass ``ensure_ascii=False`` as a parameter to the
|
2006-09-26 01:44:07 +08:00
|
|
|
``serialize()`` call. Otherwise, the output won't be encoded correctly.
|
2006-09-22 21:26:07 +08:00
|
|
|
|
|
|
|
For example::
|
|
|
|
|
2007-05-08 11:07:58 +08:00
|
|
|
json_serializer = serializers.get_serializer("json")()
|
2006-09-22 21:26:07 +08:00
|
|
|
json_serializer.serialize(queryset, ensure_ascii=False, stream=response)
|
|
|
|
|
2007-12-02 00:58:52 +08:00
|
|
|
The Django source code includes the simplejson_ module. Be aware that if
|
|
|
|
you're serializing using that module directly, not all Django output can be
|
2007-11-04 10:08:24 +08:00
|
|
|
passed unmodified to simplejson. In particular, `lazy translation objects`_
|
|
|
|
need a `special encoder`_ written for them. Something like this will work::
|
|
|
|
|
|
|
|
from django.utils.functional import Promise
|
|
|
|
from django.utils.encoding import force_unicode
|
|
|
|
|
|
|
|
class LazyEncoder(simplejson.JSONEncoder):
|
|
|
|
def default(self, obj):
|
|
|
|
if isinstance(obj, Promise):
|
|
|
|
return force_unicode(obj)
|
|
|
|
return obj
|
|
|
|
|
|
|
|
.. _lazy translation objects: ../i18n/#lazy-translation
|
|
|
|
.. _special encoder: http://svn.red-bean.com/bob/simplejson/tags/simplejson-1.7/docs/index.html
|