642 lines
22 KiB
Plaintext
642 lines
22 KiB
Plaintext
================================
|
|
The ``py.test`` tool and library
|
|
================================
|
|
|
|
.. contents::
|
|
.. sectnum::
|
|
|
|
|
|
This document is about the *usage* of the ``py.test`` testing tool. There is
|
|
also document describing the `implementation and the extending of py.test`_.
|
|
|
|
.. _`implementation and the extending of py.test`: impl-test.html
|
|
|
|
starting point: ``py.test`` command line tool
|
|
=============================================
|
|
|
|
First, see `getting started`_ for how to install the 'py.test' tool
|
|
on your system.
|
|
|
|
``py.test`` is the command line tool to run tests. You can supply it
|
|
with any Python module by passing it as an argument::
|
|
|
|
py.test test_sample.py
|
|
|
|
``py.test`` looks for any functions and methods in the module that
|
|
start with with ``test_`` and will then run those methods. Assertions
|
|
about test outcomes are done via the standard ``assert`` statement.
|
|
|
|
This means you can write tests without any boilerplate::
|
|
|
|
# content of test_sample.py
|
|
def test_answer():
|
|
assert 42 == 43
|
|
|
|
As you can see, you can have test functions as well as test
|
|
methods. This in contrast to the Python standard library's
|
|
``unittest.py``.
|
|
|
|
You can use ``py.test`` to run all tests in a directory structure by
|
|
invoking it without any arguments::
|
|
|
|
py.test
|
|
|
|
This will automatically collect and run any Python module whose filenames
|
|
start with ``test_`` or ends with ``_test`` from the directory and any
|
|
subdirectories, starting with the current directory, and run them. Each
|
|
Python test module is inspected for test methods starting with ``test_``.
|
|
|
|
.. _`getting started`: getting-started.html
|
|
|
|
|
|
.. _features:
|
|
|
|
Basic Features of ``py.test``
|
|
=============================
|
|
|
|
assert with the ``assert`` statement
|
|
------------------------------------
|
|
|
|
Writing assertions is very simple and this is one of py.test's
|
|
most noticeable features, as you can use the ``assert``
|
|
statement with arbitrary expressions. For example you can
|
|
write the following in your tests::
|
|
|
|
assert hasattr(x, 'attribute')
|
|
|
|
to state that your object has a certain ``attribute``. In case this
|
|
assertion fails the test ``reporter`` will provide you with a very
|
|
helpful analysis and a clean traceback.
|
|
|
|
Note that in order to display helpful analysis of a failing
|
|
``assert`` statement some magic takes place behind the
|
|
scenes. For now, you only need to know that if something
|
|
looks strange or you suspect a bug in that
|
|
*behind-the-scenes-magic* you may turn off the magic by
|
|
providing the ``--nomagic`` option.
|
|
|
|
how to write assertions about exceptions
|
|
----------------------------------------
|
|
|
|
In order to write assertions about exceptions, you use
|
|
one of two forms::
|
|
|
|
py.test.raises(Exception, func, *args, **kwargs)
|
|
py.test.raises(Exception, "func(*args, **kwargs)")
|
|
|
|
both of which execute the given function with args and kwargs and
|
|
asserts that the given ``Exception`` is raised. The reporter will
|
|
provide you with helpful output in case of failures such as *no
|
|
exception* or *wrong exception*.
|
|
|
|
|
|
automatic collection of tests on all levels
|
|
-------------------------------------------
|
|
|
|
The automated test collection process walks the current
|
|
directory (or the directory given as a command line argument)
|
|
and all its subdirectories and collects python modules with a
|
|
leading ``test_`` or trailing ``_test`` filename. From each
|
|
test module every function with a leading ``test_`` or class with
|
|
a leading ``Test`` name is collected. The collecting process can
|
|
be customized at directory, module or class level. (see
|
|
`collection process`_ for some implementation details).
|
|
|
|
.. _`generative tests`:
|
|
.. _`collection process`: impl-test.html#collection-process
|
|
|
|
generative tests: yielding more tests
|
|
-------------------------------------
|
|
|
|
*Generative tests* are test methods that are *generator functions* which
|
|
``yield`` callables and their arguments. This is most useful for running a
|
|
test function multiple times against different parameters.
|
|
Example::
|
|
|
|
def test_generative():
|
|
for x in (42,17,49):
|
|
yield check, x
|
|
|
|
def check(arg):
|
|
assert arg % 7 == 0 # second generated tests fails!
|
|
|
|
Note that ``test_generative()`` will cause three tests
|
|
to get run, notably ``check(42)``, ``check(17)`` and ``check(49)``
|
|
of which the middle one will obviously fail.
|
|
|
|
.. _`selection by keyword`:
|
|
|
|
selecting tests by keyword
|
|
--------------------------
|
|
|
|
You can selectively run tests by specifiying a keyword
|
|
on the command line. Example::
|
|
|
|
py.test -k test_simple
|
|
|
|
will run all tests that are found from the current directory
|
|
and where the word "test_simple" equals the start of one part of the
|
|
path leading up to the test item. Directory and file basenames as well
|
|
as function, class and function/method names each form a possibly
|
|
matching name.
|
|
|
|
Note that the exact semantics are still experimental but
|
|
should always remain intuitive.
|
|
|
|
testing with multiple python versions / executables
|
|
---------------------------------------------------
|
|
|
|
With ``--exec=EXECUTABLE`` you can specify a python
|
|
executable (e.g. ``python2.2``) with which the tests
|
|
will be executed.
|
|
|
|
|
|
testing starts immediately
|
|
--------------------------
|
|
|
|
Testing starts as soon as the first ``test item``
|
|
is collected. The collection process is iterative
|
|
and does not need to complete before your first
|
|
test items are executed.
|
|
|
|
no interference with cmdline utilities
|
|
--------------------------------------
|
|
|
|
As ``py.test`` mainly operates as a separate cmdline
|
|
tool you can easily have a command line utility and
|
|
some tests in the same file.
|
|
|
|
debug with the ``print`` statement
|
|
----------------------------------
|
|
|
|
By default, ``py.test`` catches text written to stdout/stderr during
|
|
the execution of each individual test. This output will only be
|
|
displayed however if the test fails; you will not see it
|
|
otherwise. This allows you to put debugging print statements in your
|
|
code without being overwhelmed by all the output that might be
|
|
generated by tests that do not fail.
|
|
|
|
Each failing test that produced output during the running of the test
|
|
will have its output displayed in the ``recorded stdout`` section.
|
|
|
|
The catching of stdout/stderr output can be disabled using the
|
|
``--nocapture`` option to the ``py.test`` tool. Any output will
|
|
in this case be displayed as soon as it is generated.
|
|
|
|
test execution order
|
|
--------------------------------
|
|
|
|
Tests usually run in the order in which they appear in the files.
|
|
However, tests should not rely on running one after another, as
|
|
this prevents more advanced usages: running tests
|
|
distributedly or selectively, or in "looponfailing" mode,
|
|
will cause them to run in random order.
|
|
|
|
useful tracebacks, recursion detection
|
|
--------------------------------------
|
|
|
|
A lot of care is taken to present nice tracebacks in case of test
|
|
failure. Try::
|
|
|
|
py.test py/documentation/example/pytest/failure_demo.py
|
|
|
|
to see a variety of 17 tracebacks, each tailored to a different
|
|
failure situation.
|
|
|
|
``py.test`` uses the same order for presenting tracebacks as Python
|
|
itself: the outer function is shown first, and the most recent call is
|
|
shown last. Similarly, a ``py.test`` reported traceback starts with your
|
|
failing test function and then works its way downwards. If the maximum
|
|
recursion depth has been exceeded during the running of a test, for
|
|
instance because of infinite recursion, ``py.test`` will indicate
|
|
where in the code the recursion was taking place. You can
|
|
inhibit traceback "cutting" magic by supplying ``--fulltrace``.
|
|
|
|
There is also the possibility of usind ``--tb=short`` to get the regular Python
|
|
tracebacks (which can sometimes be useful when they are extremely long). Or you
|
|
can use ``--tb=no`` to not show any tracebacks at all.
|
|
|
|
no inheritance requirement
|
|
--------------------------
|
|
|
|
Test classes are recognized by their leading ``Test`` name. Unlike
|
|
``unitest.py``, you don't need to inherit from some base class to make
|
|
them be found by the test runner. Besides being easier, it also allows
|
|
you to write test classes that subclass from application level
|
|
classes.
|
|
|
|
disabling a test class
|
|
----------------------
|
|
|
|
If you want to disable a complete test class you
|
|
can set the class-level attribute ``disabled``.
|
|
For example, in order to avoid running some tests on Win32::
|
|
|
|
class TestEgSomePosixStuff:
|
|
disabled = sys.platform == 'win32'
|
|
|
|
def test_xxx(self):
|
|
...
|
|
|
|
testing for deprecated APIs
|
|
------------------------------
|
|
|
|
In your tests you can use ``py.test.deprecated_call(func, *args, **kwargs)``
|
|
to test that a particular function call triggers a DeprecationWarning.
|
|
This is useful for testing phasing out of old APIs in your projects.
|
|
|
|
Managing test state across test modules, classes and methods
|
|
------------------------------------------------------------
|
|
|
|
Often you want to create some files, database connections or other
|
|
state in order to run tests in a certain environment. With
|
|
``py.test`` there are three scopes for which you can provide hooks to
|
|
manage such state. Again, ``py.test`` will detect these hooks in
|
|
modules on a name basis. The following module-level hooks will
|
|
automatically be called by the session::
|
|
|
|
def setup_module(module):
|
|
""" setup up any state specific to the execution
|
|
of the given module.
|
|
"""
|
|
|
|
def teardown_module(module):
|
|
""" teardown any state that was previously setup
|
|
with a setup_module method.
|
|
"""
|
|
|
|
The following hooks are available for test classes::
|
|
|
|
def setup_class(cls):
|
|
""" setup up any state specific to the execution
|
|
of the given class (which usually contains tests).
|
|
"""
|
|
|
|
def teardown_class(cls):
|
|
""" teardown any state that was previously setup
|
|
with a call to setup_class.
|
|
"""
|
|
|
|
def setup_method(self, method):
|
|
""" setup up any state tied to the execution of the given
|
|
method in a class. setup_method is invoked for every
|
|
test method of a class.
|
|
"""
|
|
|
|
def teardown_method(self, method):
|
|
""" teardown any state that was previously setup
|
|
with a setup_method call.
|
|
"""
|
|
|
|
The last two hooks, ``setup_method`` and ``teardown_method``, are
|
|
equivalent to ``setUp`` and ``tearDown`` in the Python standard
|
|
library's ``unitest`` module.
|
|
|
|
All setup/teardown methods are optional. You could have a
|
|
``setup_module`` but no ``teardown_module`` and the other way round.
|
|
|
|
Note that while the test session guarantees that for every ``setup`` a
|
|
corresponding ``teardown`` will be invoked (if it exists) it does
|
|
*not* guarantee that any ``setup`` is called only happens once. For
|
|
example, the session might decide to call the ``setup_module`` /
|
|
``teardown_module`` pair more than once during the execution of a test
|
|
module.
|
|
|
|
Experimental doctest support
|
|
------------------------------------------------------------
|
|
|
|
If you want to integrate doctests, ``py.test`` now by default
|
|
picks up files matching the ``test_*.txt`` or ``*_test.txt``
|
|
patterns and processes them as text files containing doctests.
|
|
This is an experimental feature and likely to change
|
|
its implementation.
|
|
|
|
Working Examples
|
|
================
|
|
|
|
Example for managing state at module, class and method level
|
|
------------------------------------------------------------
|
|
|
|
Here is a working example for what goes on when you setup modules,
|
|
classes and methods::
|
|
|
|
# [[from py/documentation/example/pytest/test_setup_flow_example.py]]
|
|
|
|
def setup_module(module):
|
|
module.TestStateFullThing.classcount = 0
|
|
|
|
class TestStateFullThing:
|
|
def setup_class(cls):
|
|
cls.classcount += 1
|
|
|
|
def teardown_class(cls):
|
|
cls.classcount -= 1
|
|
|
|
def setup_method(self, method):
|
|
self.id = eval(method.func_name[5:])
|
|
|
|
def test_42(self):
|
|
assert self.classcount == 1
|
|
assert self.id == 42
|
|
|
|
def test_23(self):
|
|
assert self.classcount == 1
|
|
assert self.id == 23
|
|
|
|
def teardown_module(module):
|
|
assert module.TestStateFullThing.classcount == 0
|
|
|
|
For this example the control flow happens as follows::
|
|
|
|
import test_setup_flow_example
|
|
setup_module(test_setup_flow_example)
|
|
setup_class(TestStateFullThing)
|
|
instance = TestStateFullThing()
|
|
setup_method(instance, instance.test_42)
|
|
instance.test_42()
|
|
setup_method(instance, instance.test_23)
|
|
instance.test_23()
|
|
teardown_class(TestStateFullThing)
|
|
teardown_module(test_setup_flow_example)
|
|
|
|
|
|
Note that ``setup_class(TestStateFullThing)`` is called and not
|
|
``TestStateFullThing.setup_class()`` which would require you
|
|
to insert ``setup_class = classmethod(setup_class)`` to make
|
|
your setup function callable. Did we mention that lazyness
|
|
is a virtue?
|
|
|
|
Some ``py.test`` command-line options
|
|
=====================================
|
|
|
|
Regular options
|
|
---------------
|
|
|
|
``-v, --verbose``
|
|
Increase verbosity. This shows a test per line while running and also
|
|
shows the traceback after interrupting the test run with Ctrl-C.
|
|
|
|
|
|
``-x, --exitfirst``
|
|
exit instantly on the first error or the first failed test.
|
|
|
|
|
|
``-s, --nocapture``
|
|
disable catching of sys.stdout/stderr output.
|
|
|
|
|
|
``-k KEYWORD``
|
|
only run test items matching the given keyword expression. You can also add
|
|
use ``-k -KEYWORD`` to exlude tests from being run. The keyword is matched
|
|
against filename, test class name, method name.
|
|
|
|
|
|
``-l, --showlocals``
|
|
show locals in tracebacks: for every frame in the traceback, show the values
|
|
of the local variables.
|
|
|
|
|
|
``--pdb``
|
|
drop into pdb (the `Python debugger`_) on exceptions. If the debugger is
|
|
quitted, the next test is run. This implies ``-s``.
|
|
|
|
|
|
``--tb=TBSTYLE``
|
|
traceback verboseness: ``long`` is the default, ``short`` are the normal
|
|
Python tracebacks, ``no`` omits tracebacks completely.
|
|
|
|
|
|
``--fulltrace``
|
|
Don't cut any tracebacks. The default is to leave out frames if an infinite
|
|
recursion is detected.
|
|
|
|
|
|
``--nomagic``
|
|
Refrain from using magic as much as possible. This can be useful if you are
|
|
suspicious that ``py.test`` somehow interferes with your program in
|
|
unintended ways (if this is the case, please contact us!).
|
|
|
|
|
|
``--collectonly``
|
|
Only collect tests, don't execute them.
|
|
|
|
|
|
``--traceconfig``
|
|
trace considerations of conftest.py files. Useful when you have various
|
|
conftest.py files around and are unsure about their interaction.
|
|
|
|
``-f, --looponfailing``
|
|
Loop on failing test set. This is a feature you can use when you are trying
|
|
to fix a number of failing tests: First all the tests are being run. If a
|
|
number of tests are failing, these are run repeatedly afterwards. Every
|
|
repetition is started once a file below the directory that you started
|
|
testing for is changed. If one of the previously failing tests now passes,
|
|
it is removed from the test set.
|
|
|
|
``--exec=EXECUTABLE``
|
|
Python executable to run the tests with. Useful for testing on different
|
|
versions of Python.
|
|
|
|
|
|
|
|
experimental options
|
|
--------------------
|
|
|
|
**Note**: these options could change in the future.
|
|
|
|
|
|
``-d, --dist``
|
|
ad-hoc `distribute tests across machines`_ (requires conftest settings)
|
|
|
|
|
|
``-w, --startserver``
|
|
starts local web server for displaying test progress.
|
|
|
|
|
|
``-r, --runbrowser``
|
|
Run browser (implies --startserver).
|
|
|
|
|
|
``--boxed``
|
|
Use boxed tests: run each test in an external process. Very useful for testing
|
|
things that occasionally segfault (since normally the segfault then would
|
|
stop the whole test process).
|
|
|
|
``--rest``
|
|
`reStructured Text`_ output reporting.
|
|
|
|
|
|
.. _`reStructured Text`: http://docutils.sourceforge.net
|
|
.. _`Python debugger`: http://docs.python.org/lib/module-pdb.html
|
|
|
|
|
|
.. _`distribute tests across machines`:
|
|
|
|
|
|
Automated Distributed Testing
|
|
==================================
|
|
|
|
If you have a project with a large number of tests, and you have
|
|
machines accessible through SSH, ``py.test`` can distribute
|
|
tests across the machines. It does not require any particular
|
|
installation on the remote machine sides as it uses `py.execnet`_
|
|
mechanisms to distribute execution. Using distributed testing
|
|
can speed up your development process considerably and it
|
|
may also be useful where you need to use a remote server
|
|
that has more resources (e.g. RAM/diskspace) than your
|
|
local machine.
|
|
|
|
*WARNING*: support for distributed testing is experimental,
|
|
its mechanics and configuration options may change without
|
|
prior notice. Particularly, not all reporting features
|
|
of the in-process py.test have been integrated into
|
|
the distributed testing approach.
|
|
|
|
Requirements
|
|
------------
|
|
|
|
Local requirements:
|
|
|
|
* ssh client
|
|
* python
|
|
|
|
requirements for remote machines:
|
|
|
|
* ssh daemon running
|
|
* ssh keys setup to allow login without a password
|
|
* python (2.3 or 2.4 should work)
|
|
* unix like machine (reliance on ``os.fork``)
|
|
|
|
How to use it
|
|
-----------------------
|
|
|
|
When you issue ``py.test -d`` then your computer becomes
|
|
the distributor of tests ("master") and will start collecting
|
|
and distributing tests to several machines. The machines
|
|
need to be specified in a ``conftest.py`` file.
|
|
|
|
At start up, the master connects to each node using `py.execnet.SshGateway`_
|
|
and *rsyncs* all specified python packages to all nodes.
|
|
Then the master collects all of the tests and immediately sends test item
|
|
descriptions to its connected nodes. Each node has a local queue of tests
|
|
to run and begins to execute the tests, following the setup and teardown
|
|
semantics. The test are distributed at function and method level.
|
|
When a test run on a node is completed it reports back the result
|
|
to the master.
|
|
|
|
The master can run one of three reporters to process the events
|
|
from the testing nodes: command line, rest output and ajaxy web based.
|
|
|
|
.. _`py.execnet`: execnet.html
|
|
.. _`py.execnet.SshGateway`: execnet.html
|
|
|
|
Differences from local tests
|
|
----------------------------
|
|
|
|
* Test order is *not* guaranteed.
|
|
* Hanging nodes or tests are not detected properly.
|
|
* ``conftest.py`` cannot reference files outside of the copied packages.
|
|
|
|
Configuration
|
|
-------------
|
|
|
|
You must create a conftest.py in any parent directory above your tests.
|
|
|
|
The options that you need to specify in that conftest.py file are:
|
|
|
|
* `dist_hosts`: a required list of host specifications
|
|
* `dist_rsync_roots` - a list of relative locations to copy to the remote machines.
|
|
* `dist_rsync_ignore` - a list of relative locations to ignore for rsyncing
|
|
* `dist_remotepython` - the remote python executable to run.
|
|
* `dist_nicelevel` - process priority of remote nodes.
|
|
* `dist_boxed` - will run each single test in a separate process
|
|
(allowing to survive segfaults for example)
|
|
* `dist_taskspernode` - Maximum number of tasks being queued to remote nodes
|
|
|
|
Sample configuration::
|
|
|
|
dist_hosts = ['localhost', 'user@someserver:/tmp/somedir']
|
|
dist_rsync_roots = ['../pypy', '../py']
|
|
dist_remotepython = 'python2.4'
|
|
dist_nicelevel = 10
|
|
dist_boxed = False
|
|
dist_maxwait = 100
|
|
dist_taskspernode = 10
|
|
|
|
To use the browser-based reporter (with a nice AJAX interface) you have to tell
|
|
``py.test`` to run a small server locally using the ``-w`` or ``--startserver``
|
|
command line options. Afterwards you can point your browser to localhost:8000
|
|
to see the progress of the testing.
|
|
|
|
Development Notes
|
|
-----------------
|
|
|
|
Changing the behavior of the web based reporter requires `pypy`_ since the
|
|
javascript is actually generated fom rpython source.
|
|
|
|
There exists as well `L/Rsession document`_ which discusses in more details
|
|
unique features and developement notes.
|
|
|
|
.. _`pypy`: http://codespeak.net/pypy
|
|
.. _`L/Rsession document`: test-distributed.html
|
|
|
|
Future/Planned Features of py.test
|
|
==================================
|
|
|
|
Please note that the following descriptions of future features
|
|
sound factual although they aren't implemented yet. This
|
|
allows easy migration to real documentation later.
|
|
Nevertheless, none of the described planned features is
|
|
set in stone, yet. In fact, they are open to discussion on
|
|
py-dev at codespeak dot net.
|
|
|
|
Hey, if you want to suggest new features or command line options
|
|
for py.test it would be great if you could do it by providing
|
|
documentation for the feature. Welcome to documentation driven
|
|
development :-)
|
|
|
|
selecting tests by queries/full text search
|
|
-------------------------------------------
|
|
|
|
Note: there already is experimental support for test `selection by keyword`_.
|
|
Otherwise the following is not yet implemented
|
|
|
|
You can selectively run tests by specifiying words
|
|
on the command line in a google-like way. Example::
|
|
|
|
py.test simple
|
|
|
|
will run all tests that are found from the current directory
|
|
and that have the word "simple" somewhere in their `test address`_.
|
|
``test_simple1`` and ``TestSomething.test_whatever_simpleton`` would both
|
|
qualify. If you want to exclude the latter test method you could say::
|
|
|
|
py.test -- simple -simpleton
|
|
|
|
Note that the doubledash "--" signals the end of option parsing so
|
|
that "-simpleton" will not be misinterpreted as a command line option.
|
|
|
|
Interpreting positional arguments as specifying search queries
|
|
means that you can only restrict the set of tests. There is no way to
|
|
say "run all 'simple' in addition to all 'complex' tests". If this proves
|
|
to be a problem we can probably come up with a command line option
|
|
that allows to specify multiple queries which all add to the set of
|
|
tests-to-consider.
|
|
|
|
.. _`test address`:
|
|
|
|
the concept of a test address
|
|
-----------------------------
|
|
|
|
For specifiying tests it is convenient to define the notion
|
|
of a *test address*, representable as a filesystem path and a
|
|
list of names leading to a test item. If represented as a single
|
|
string the path and names are separated by a `/` character, for example:
|
|
|
|
``somedir/somepath.py/TestClass/test_method``
|
|
|
|
Such representations can be used to memoize failing tests
|
|
by writing them out in a file or communicating them across
|
|
process and computer boundaries.
|
|
|