Quick Python Performance Optimization - Python Performance 4 of n

最新推荐文章于 2025-06-27 02:20:57 发布

转载最新推荐文章于 2025-06-27 02:20:57 发布 · 1.2k 阅读

Python 专栏收录该内容

52 篇文章

订阅专栏

本文是一系列关于Python性能优化的博客文章的第一部分，旨在通过简单的实践步骤提高代码效率。包括使用%timeit和%prun进行代码性能分析，减少函数调用次数，使用xrange而非range，对于大量数据使用numpy，字符串操作优化，循环结构选择，利用内置函数和模块的C版本，以及内存管理和数据结构选择等。继续阅读以了解更深入的优化技巧。

Quick Python Performance Optimization

This is part I of a two part series of blog post on performance optimization in python.

Aim is to just explain, the right way to do simple things which we use in day-to-day python programming, and has a very relevant performance impact.

1. %timeit (per line) and %prun (cProfile) in ipython interactive shell.

Profile your code while working on it, and try to find of where is the bottleneck. This is not contrary to the fact that premature optimization is the root of all evil. This is mean for the first level optimization and not a heavy optimization sequence.

For more on profiling python code, you should read this: http://www.huyng.com/posts/python-performance-analysis/

Another interesting library, line_profiler is for line by line profiling https://bitbucket.org/robertkern/line_profiler

2. Reduce the number of function calls.
If you need a list to be manipulated, pass the entire list, rather than iterating over the list and passing each element to the function and returning.

3. Use xrange instead of range.

xrange is C implementation of range, with an eye on efficient memory usage.

4. For huge data, use numpy, its better than standard data structures.

5. "".join(string) is better than + or +=

6. while 1 is faster than while True

7. list comphrension > for loop > while loop
list comprehension is faster than looping over the list, and while loop is the slowest, with an external counter with it.

8. use cProfile, cStringIO and cPickle

always use available C versions of the modules.

9. Use local variables.

local variables are faster than global variables, builtins or attribute lookups

10. list and iterators versions exist - iterators are memory efficient and scalable. Use itertools

Create generators and use yeild as much as posible. They are faster compared to the normal list way of doing it.

http://www.diveinto.org/python3/iterators.html

http://stackoverflow.com/questions/231767/the-python-yield-keyword-explained

Lets continue to part two for next level of quick optimization tricks here.

This is the Part II of Quick Python Performance Optimizations.

11. Use Map, Reduce and Filter instead of for loop where ever possible.

12. for checking 'a in b', dict or set is better than list/tuple.

13. while working with big amount of data, if possible use immutable datatypes, they are faster - tuples > list

14. insertion into a list in O(n) complexity.

15. if you need manipulations on both ends of a list, use deque.

16. del - delete the objects after use -

Python does it by itself. But make sure of that with the gc module or
by writing an __del__ magic function or
the simplest way, del after use.

17. time.clock()

18. GIL(http://wiki.python.org/moin/GlobalInterpreterLock) - GIL is a demon.

GIL allows only one python native thread to be run per process, preventing CPU level parallelism. Try using ctypes and native C libararies to overcome this. When even you reach the end of optimizing with python, always there exist an option of rewriting terribly slow functions in native C, and using it through python C bindings. Other libraries like gevent is also attacking the problem, and is successful to some extend.

TL,DR: While you write code, just give one round of thought on the data structures, the iteration constructs, builtins and create C extensions for tricking the GIL if need.

From
http://infiniteloop.in/blog/quick-python-performance-optimization-part-i/
http://infiniteloop.in/blog/quick-python-performance-optimization-part-ii/