Python socket recv如何高效接收数据？-杰瑞科技汇

Of course! The socket.recv() method is a fundamental part of network programming in Python. Here’s a comprehensive guide covering what it is, how it works, common pitfalls, and best practices.

What is `socket.recv()`?

recv() is a method on a socket object that receives data from the socket. It's the primary way your program reads information sent by a remote computer over a network.

When you call recv(), your program will block (pause and wait) until data is actually available to be read from the network buffer.

Basic Syntax and Return Value

data = socket.recv(bufsize)

socket: The socket object you created and are listening on or connected to.
bufsize: An integer representing the maximum amount of data (in bytes) to receive at one time. This is a crucial parameter.
- If the amount of incoming data is larger than bufsize, the excess data will remain in the operating system's network buffer and will be returned in the next call to recv().
- A common value is 4096 (4KB).
Return Value:
- On success, it returns the data as a bytes object.
- If the remote side has closed the connection gracefully, recv() will return an empty bytes object (b''). This is the standard way to know the connection is terminated.
- If an error occurs (e.g., the network connection is forcibly broken), it will raise an exception (like ConnectionResetError).

The Most Important Concept: Receiving Messages vs. Receiving a Stream

This is the most common point of confusion for beginners. TCP, which is what most socket communication uses, provides a byte stream, not a message stream.

What you send: You might send "Hello" and then "World".
What the other side receives: They might receive "He" in the first recv() call and "lloWorld" in the second.

The network layer doesn't preserve your message boundaries. Therefore, you cannot rely on a single recv() call to get your entire message. You need a protocol to handle this.

Common Receiving Patterns

Here are the three most common patterns for handling data with recv().

Pattern 1: Receiving a Fixed-Size Message

If you know the exact size of the message you're expecting, you can simply loop until you've received all of it.

# Assuming 'conn' is your connected socket
# and you expect to receive a 16-byte message
expected_size = 16
data = b''
while len(data) < expected_size:
    packet = conn.recv(expected_size - len(data))
    if not packet: # Connection closed by client
        print("Client disconnected prematurely.")
        break
    data += packet
print(f"Received full message: {data}")

Pattern 2: Receiving Until a Delimiter (The "Readline" Pattern)

This is very common for text-based protocols (like HTTP or SMTP). You define a special character (or sequence) that marks the end of a message, such as a newline character (\n).

# Assuming 'conn' is your connected socket
buffer = b''
while True:
    chunk = conn.recv(4096) # Receive a chunk
    if not chunk: # Connection closed
        break
    buffer += chunk
    # Check if our delimiter is in the buffer
    if b'\n' in buffer:
        # Split the buffer into the message and the rest
        message, buffer = buffer.split(b'\n', 1)
        print(f"Received message: {message.decode('utf-8')}")
        # You can process the message here
        # If you expect only one message, you would break here
        # break

Pattern 3: The Length-Prefixed Message (Most Robust)

This is the most common and robust method for sending arbitrary data (including binary data). Before sending the actual message, you send the length of the message as a fixed-size integer.

Client-Side Sending Logic:

message = "This is my important message"
# 1. Get the length of the message and encode it
length_bytes = len(message.encode('utf-8')).to_bytes(4, 'big') # 4-byte length
# 2. Send the length first
conn.sendall(length_bytes)
# 3. Then send the actual message
conn.sendall(message.encode('utf-8'))

Server-Side Receiving Logic (The Pattern):

# Assuming 'conn' is your connected socket
# 1. First, receive the 4-byte length prefix
length_prefix = conn.recv(4)
if not length_prefix:
    print("Client disconnected.")
    # Handle disconnection
# 2. Unpack the length from the bytes
message_length = int.from_bytes(length_prefix, 'big')
# 3. Now, receive the full message based on its length
data_received = 0
full_message = b''
while data_received < message_length:
    # Receive the remaining part of the message
    chunk = conn.recv(message_length - data_received)
    if not chunk:
        print("Connection broken while receiving message.")
        break
    full_message += chunk
    data_received += len(chunk)
# 4. You now have the complete message
print(f"Received full message: {full_message.decode('utf-8')}")

A Complete, Practical Example (Echo Server)

This example demonstrates a simple server that accepts a connection, receives data from the client using the length-prefixed pattern, prints it, and sends it back.

server.py

import socket
HOST = '127.0.0.1'  # Standard loopback interface address (localhost)
PORT = 65432        # Port to listen on (non-privileged ports are > 1023)
with socket.socket(socket.AF_INET, socket.SOCK_STREAM) as s:
    s.bind((HOST, PORT))
    s.listen()
    print(f"Server listening on {HOST}:{PORT}")
    conn, addr = s.accept()
    with conn:
        print(f"Connected by {addr}")
        while True:
            # 1. Receive the 4-byte length prefix
            length_prefix = conn.recv(4)
            if not length_prefix:
                print("Client disconnected.")
                break
            # 2. Unpack the length
            message_length = int.from_bytes(length_prefix, 'big')
            print(f"Expecting a message of {message_length} bytes.")
            # 3. Receive the full message
            data_received = 0
            full_message = b''
            while data_received < message_length:
                chunk = conn.recv(message_length - data_received)
                if not chunk:
                    print("Connection broken.")
                    break
                full_message += chunk
                data_received += len(chunk)
            if data_received < message_length:
                break # Connection was broken
            # 4. Process the message
            print(f"Received: {full_message.decode('utf-8')}")
            # 5. Echo the message back to the client
            conn.sendall(full_message)

client.py

import socket
HOST = '127.0.0.1'  # The server's hostname or IP address
PORT = 65432        # The port used by the server
messages = [
    "Hello, server!",
    "This is a longer message to test the length-prefixed protocol.",
    "Short"
]
with socket.socket(socket.AF_INET, socket.SOCK_STREAM) as s:
    s.connect((HOST, PORT))
    for msg in messages:
        # 1. Encode the message and get its length
        message_bytes = msg.encode('utf-8')
        length_bytes = len(message_bytes).to_bytes(4, 'big')
        # 2. Send length prefix and then the message
        s.sendall(length_bytes)
        s.sendall(message_bytes)
        # 3. Receive the echoed message back from the server
        # We use the same length-prefixed logic to read it
        echoed_length_prefix = s.recv(4)
        echoed_length = int.from_bytes(echoed_length_prefix, 'big')
        echoed_data_received = 0
        echoed_full_message = b''
        while echoed_data_received < echoed_length:
            chunk = s.recv(echoed_length - echoed_data_received)
            echoed_full_message += chunk
            echoed_data_received += len(chunk)
        print(f"Server echoed: {echoed_full_message.decode('utf-8')}")
print("Client finished.")

Key Takeaways & Best Practices

recv() returns bytes, not strings. You must decode the bytes object into a string (e.g., using .decode('utf-8')) if you are working with text.
TCP is a stream, not a message protocol. Never assume one recv() call equals one complete message.
Handle the empty bytes return value. Always check if not data: after a recv() call to detect a clean disconnection from the remote host.
Use a length-prefixed protocol. For most applications, sending the length of the data before the data itself is the most reliable way to handle message boundaries.
recv(0) is a special case. It can be used to "peek" at the incoming data without removing it from the buffer. It will return up to bufsize bytes of data, but that data will still be the first thing returned by the next recv() call. This is useful for checking for a delimiter without consuming it.

Python socket recv如何高效接收数据？

What is `socket.recv()`?

Basic Syntax and Return Value

The Most Important Concept: Receiving Messages vs. Receiving a Stream

Common Receiving Patterns

Pattern 1: Receiving a Fixed-Size Message

Pattern 2: Receiving Until a Delimiter (The "Readline" Pattern)

Pattern 3: The Length-Prefixed Message (Most Robust)

A Complete, Practical Example (Echo Server)

Key Takeaways & Best Practices

99ANYc3cd6

佳能100D怎么用？新手必看教程指南。

QtCreator安装教程，如何一步步完成安装？

Photoshop官方教程怎么学？新手入门看哪些？

Java Swing图书管理如何实现核心功能？

Java如何实现Telnet服务器？

Java Socket编程实例具体怎么实现？

Python egg info 模块如何创建与管理？

Premiere CS6教程怎么学？新手入门指南

FastDFS Python API如何使用？

Java如何调用WebService？

Java SDK 64位下载地址哪里找？

Java中Socket与HTTP有何区别？

MySQL与Java数据类型如何对应转换？

pkpm钢结构设计视频教程

Java中double二进制如何表示与计算？

python django开发web

Python socket recv如何高效接收数据？

What is socket.recv()?

Basic Syntax and Return Value

The Most Important Concept: Receiving Messages vs. Receiving a Stream

Common Receiving Patterns

Pattern 1: Receiving a Fixed-Size Message

Pattern 2: Receiving Until a Delimiter (The "Readline" Pattern)

Pattern 3: The Length-Prefixed Message (Most Robust)

A Complete, Practical Example (Echo Server)

Key Takeaways & Best Practices

相关推荐

Java Socket编程实例具体怎么实现？

What is `socket.recv()`?