Python中str与bytes互相转换

最新推荐文章于 2025-06-26 21:55:37 发布

there2belief

最新推荐文章于 2025-06-26 21:55:37 发布

阅读量9.3k

点赞数 2

CC 4.0 BY-SA版权

分类专栏： AI/ML/DL 泛coding

本文链接：https://blog.youkuaiyun.com/dou3516/article/details/87440879

AI/ML/DL 同时被 2 个专栏收录

254 篇文章

订阅专栏

泛coding

59 篇文章

订阅专栏

本文详细介绍了在Python中如何将字符串快速转换为字节，以及如何将字节转换回字符串的方法。通过使用str.encode()和bytes.decode()函数，可以轻松实现数据类型的转换。此外，还探讨了使用bytearray构造函数的多种方式，包括初始化不同来源的数据。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

快速转换方式

# str to bytes
my_str = "hello world"
my_str_as_bytes = str.encode(my_str)
type(my_str_as_bytes) # ensure it is byte representation
# bytes to str
my_decoded_str = my_str_as_bytes.decode()
type(my_decoded_str) # ensure it is string representation

另一种str to bytes方式：bytearray([source[, encoding[, errors]]])

If you look at the docs for bytes, it points you to bytearray:

bytearray([source[, encoding[, errors]]])

Return a new array of bytes. The bytearray type is a mutable sequence of integers in the range 0 <= x < 256. It has most of the usual methods of mutable sequences, described in Mutable Sequence Types, as well as most methods that the bytes type has, see Bytes and Byte Array Methods.

The optional source parameter can be used to initialize the array in a few different ways:

If it is a string, you must also give the encoding (and optionally, errors) parameters; bytearray() then converts the string to bytes using str.encode().

If it is an integer, the array will have that size and will be initialized with null bytes.

If it is an object conforming to the buffer interface, a read-only buffer of the object will be used to initialize the bytes array.

If it is an iterable, it must be an iterable of integers in the range 0 <= x < 256, which are used as the initial contents of the array.

Without an argument, an array of size 0 is created.

So bytes can do much more than just encode a string. It's Pythonic that it would allow you to call the constructor with any type of source parameter that makes sense.

For encoding a string, I think that some_string.encode(encoding) is more Pythonic than using the constructor, because it is the most self documenting -- "take this string and encode it with this encoding" is clearer than bytes(some_string, encoding) -- there is no explicit verb when you use the constructor.

Edit: I checked the Python source. If you pass a unicode string to bytes using CPython, it calls PyUnicode_AsEncodedString, which is the implementation of encode; so you're just skipping a level of indirection if you call encode yourself.

Also, see Serdalis' comment -- unicode_string.encode(encoding) is also more Pythonic because its inverse is byte_string.decode(encoding) and symmetry is nice.

From: https://stackoverflow.com/questions/7585435/best-way-to-convert-string-to-bytes-in-python-3