pycos: Concurrent, Asynchronous, Distributed, Communicating Tasks with Python¶
Sourceforge.net has been unreliable the past few weeks, so this documentation is now available at github as well.
pycos is renamed from asyncoro project to better reflect its functionality and to avoid confusion with asyncore module. See asyncoro2pycos for details.
pycos is a Python framework for concurrent, asynchronous, network, distributed programming and distributed computing, using generator functions, asynchronous completions and message passing. pycos can be used to create tasks with generator functions, similar to the way threads are created with functions using Python’s threading module. Programs developed with pycos have same logic and structure as programs with threads, except for a few syntactic changes - mostly using yield with asynchronous completions that give control to pycos’s scheduler, which interleaves executions of generators, similar to the way an operating system executes multiple processes.
Unlike threads, creating tasks with pycos is very efficient (see below). Moreover, with pycos context switch occurs only when tasks use yield (typically with an asychronous call), so there is no need for locking and there is no overhead of unnecessary context switches.
Unlike with other asynchronous frameworks, programming with pycos is rather straight forward; Asynchronous Concurrenct Programming (pycos) describes 4 simple steps to convert programming with threads to programming with pycos.
pycos features include:
- No callbacks or event loops! No need to lock critical sections either,
- Efficient polling mechanisms epoll, kqueue, Windows I/O Completion Ports (IOCP) for high performance and scalability,
- Asynchronous (non-blocking) sockets and pipes, for concurrent processing of I/O,
- SSL for security,
- Asynchronous locking primitives similar to Python threading module,
- Asynchronous timers and timeouts,
- Message passing for (local and remote) tasks to exchange messages one-to-one with Message Queue Pattern or through broadcasting channels with Publish-Subscribe Pattern,
- Location transparency with naming and locating (local and remote) resources,
- Monitoring and restarting of (local or remote) tasks, for fault detection and fault-tolerance,
- Distributing computation components (code and data) to execute Distributed Communicating Tasks (dispycos), for wide range of use cases, covering SIMD, MISD, MIMD system architectures at the process level, and web interface to monitor cluster/application status/performance; in-memory processing, data streaming, real-time (live) analytics and Cloud Computing are supported as well,
- Remote execution of tasks for distributed programming with Remote Task Invocation RTI (Remote Task Invocation) and message passing,
- Hot-swapping of task functions, for dynamic system reconfiguration,
- Thread pools with asynchronous task completions, for executing synchronous tasks, e.g., external library calls, such as reading standard input.
import time import pycos def task_proc(task=None): yield task.suspend() tasks = [pycos.Task(task_proc) for i in xrange(10000)] time.sleep(5) # wait for tasks to run (and suspend) for task in tasks: task.resume() for line in open('/proc/self/status'): if line.startswith('VmPeak'): print('VM Peak: %d MB' % (int(line.split()) / 1024)) elif line.startswith('VmHWM'): print('VM Max: %d MB' % (int(line.split()) / 1024)) elif line.startswith('VmStk'): print('VM Stack: %d MB' % (int(line.split()) / 1024))
shows that running 10,000 tasks takes about 23 MB of memory (with about 10 MB taken by modules used in pycos). Memory and time scale linearly: For 100,000 tasks memory used is 116 MB, for 200,000 tasks it is 218 MB, for 300,000 tasks it is 315 MB.
pycos works with Python 2.7+ and Python 3.1+ and tested on Linux, Mac OS X and Windows; it may work on other platforms too. pycos works with PyPy as well.
pycos is implemented with standard modules in Python.
Node / Servers sends node availability status (availability of CPU as percent, memory in bytes and disk space in bytes) at pulse_interval frequency if psutil module is installed. This information may be useful to clients, for example, to analyze application performance, to filter nodes based on available resources etc.
Under Windows efficient polling notifier I/O Completion Ports is supported if pywin32 is available; otherwise, inefficient ‘select’ notifier is used.
For IPv6, netifaces module is required with OS X. netifaces module is strongly recommended with Linux and Windows as well (for both IPv4 and IPv6). For IPv6 with Windows, win_inet_pton module is required with Python 2.7 (but not required with Python 3).
pycos package is available in Python Package Index (PyPI) so it can be installed with:
python -m pip install pycos
pycos can also be downloaded from Sourceforge Files. Starting with 4.6.6 release, the download file is PyPI package so it can be installed with:
python -m pip install pycos-<version>.tar.gz
Examples illustrating some of the features of pycos are installed in ‘examples’ directory under where pycos module is installed, which can be obtained with the program:
import os, pycos print(os.path.join(os.path.dirname(pycos.__file__), 'examples'))
README file in that directory for brief
description of each of the files.
If you use pycos in academic/research work, please cite pycos as:
Giridhar Pemmasani, "pycos: Concurrent, Asynchronous, Distributed, Communicating Tasks with Python", http://pycos.sourceforge.io, 2017.
- 1. Introduction
- 2. Asynchronous Concurrenct Programming (pycos)
- 3. Asynchronous Network Programming (pycos)
- 4. Asynchronous Files and Pipes (asyncfile)
- 5. Distributed Programming (netpycos)
- 6. Distributed Communicating Tasks (dispycos)
- 7. Tutorial / Examples
- 8. asyncoro2pycos
- 9. Contribute to / Recommend / Share pycos