Rust 2026 经验谈 - 异步流与迭代

异步编程中，我们不仅需要”等一个值”（Future），还需要”持续接收一组值”——这就是 Stream。Stream 是异步世界的 Iterator，但它的设计、使用和背压控制比 Iterator 复杂得多。本文从 Stream trait 本身出发，覆盖 async generator、async-stream crate、背压模式，以及 Stream 与 Iterator 的本质差异。

Stream trait：异步迭代的基础#

定义与核心方法#

Stream 定义在 futures-core 中（而非标准库），核心方法只有一个：

1
// futures-core 中的 Stream trait
2
pub trait Stream {
3
    type Item;
4

5
    fn poll_next(
6
        self: Pin<&mut Self>,
7
        cx: &mut Context<'_>,
8
    ) -> Poll<Option<Self::Item>>;
9
}

与 Iterator::next() 返回 Option<Item> 不同，Stream::poll_next 返回 Poll<Option<Item>>：

Poll::Pending：值尚未就绪，需要等待唤醒
Poll::Ready(Some(item))：产出一个值
Poll::Ready(None)：流结束

futures-util 中的实用 Stream#

futures-util 提供了丰富的 Stream 适配器，用法类似 Iterator：

1
use futures::stream::{self, StreamExt};
2
use tokio;
3

4
#[tokio::main]
5
async fn main() {
6
    // 从迭代器创建 Stream
7
    let mut s = stream::iter(vec![1, 2, 3, 4, 5])
8
        .map(|x| x * 2)
9
        .filter(|x| futures::future::ready(x % 4 == 0));
10

11
    while let Some(item) = s.next().await {
12
        println!("{item}");
13
    }
14
}

常用适配器一览：

适配器	作用	背压行为
`.map()`	转换每个元素	透传
`.filter()`	过滤元素	透传
`.buffered(n)`	并发执行 n 个 Future	有限并发
`.throttle(duration)`	限速输出	延迟丢弃
`.chunks(n)`	批量聚合	透传
`.take(n)`	取前 n 个	提前终止

踩坑提醒：StreamExt::collect() 会把整个 Stream 收集到 Vec，如果 Stream 是无限的，这会永远等待。务必确认 Stream 有限再 collect。

async fn 返回 Stream 的模式#

问题：async fn 不能直接返回 Stream#

Rust 目前不支持 async fn 返回 impl Stream，因为 async fn 只能返回 impl Future<Output = T>。要返回 Stream，需要手动实现或借助 crate。

模式一：手动实现 Stream#

1
use futures::stream::Stream;
2
use std::pin::Pin;
3
use std::task::{Context, Poll};
4

5
struct CounterStream {
6
    count: u32,
7
    limit: u32,
8
}
9

10
impl Stream for CounterStream {
11
    type Item = u32;
12

13
    fn poll_next(
14
        mut self: Pin<&mut Self>,
15
        _cx: &mut Context<'_>,
16
    ) -> Poll<Option<Self::Item>> {
17
        if self.count < self.limit {
18
            let val = self.count;
19
            self.count += 1;
20
            Poll::Ready(Some(val))
21
        } else {
22
            Poll::Ready(None)
23
        }
24
    }
25
}

手动实现的代价：你需要正确处理 Pin、Context、Poll 三件套。对于复杂逻辑（涉及 I/O、定时器），手写 poll_next 既繁琐又容易出错。

模式二：async-stream crate（推荐）#

async-stream 允许你用 async 语法写 Stream 生成逻辑：

1
use async_stream::stream;
2
use futures::stream::StreamExt;
3
use tokio::time::{sleep, Duration};
4

5
fn countdown(from: u32) -> impl futures::Stream<Item = u32> {
6
    stream! {
7
        for i in (0..from).rev() {
8
            sleep(Duration::from_millis(100)).await;
9
            yield i;
10
        }
11
    }
12
}
13

14
#[tokio::main]
15
async fn main() {
16
    let mut s = countdown(5);
17
    while let Some(val) = s.next().await {
18
        println!("倒计时: {val}");
19
    }
20
}

async-stream 的 stream! 宏通过内部生成一个 async fn + 手动状态机来实现，性能与手写相当。它的 try_stream! 变体支持 ? 错误传播：

1
use async_stream::try_stream;
2
use futures::stream::StreamExt;
3

4
fn lines_from_file(path: &str) -> impl futures::Stream<Item = Result<String, std::io::Error>> {
5
    let path = path.to_owned();
6
    try_stream! {
7
        let content = tokio::fs::read_to_string(&path).await?;
8
        for line in content.lines() {
9
            yield line.to_owned();
10
        }
11
    }
12
}

模式三：channel 转 Stream#

当数据源是外部推送（如 WebSocket、消息队列）时，channel 转 Stream 是最实用的模式：

1
use tokio::sync::mpsc;
2
use tokio_stream::wrappers::ReceiverStream;
3
use futures::stream::StreamExt;
4

5
async fn websocket_messages() -> impl futures::Stream<Item = String> {
6
    let (tx, rx) = mpsc::channel(128);
7

8
    // 模拟外部推送
9
    tokio::spawn(async move {
10
        for i in 0..10 {
11
            if tx.send(format!("msg-{i}")).await.is_err() {
12
                break;
13
            }
14
        }
15
    });
16

17
    // ReceiverStream 包装 mpsc::Receiver 以实现 Stream
18
    ReceiverStream::new(rx)
19
}

Tokio 的 mpsc::Receiver 需要通过 tokio-stream crate 的 ReceiverStream 包装后才能实现 Stream。这种模式天然支持背压：channel 满时，发送方 send().await 会等待。

async generator：gen blocks（Nightly 状态）#

Rust 社区一直在推进 gen blocks 语法，让生成器的写法更接近 Python 的 yield：

1
// nightly-only，尚未稳定，仅作演示
2
#![feature(gen_blocks)]
3

4
fn counter() -> impl Iterator<Item = u32> {
5
    gen {
6
        let mut i = 0;
7
        loop {
8
            yield i;
9
            i += 1;
10
        }
11
    }
12
}

异步版本（async gen blocks 同样仅限 nightly）：

1
// nightly-only，极度实验性，仅作演示
2
#![feature(gen_blocks)]
3

4
fn async_counter() -> impl futures::Stream<Item = u32> {
5
    async gen {
6
        let mut i = 0;
7
        loop {
8
            tokio::time::sleep(std::time::Duration::from_secs(1)).await;
9
            yield i;
10
            i += 1;
11
        }
12
    }
13
}

截至 Rust 1.96，gen blocks 和 async gen blocks 仍在 nightly 且 API 可能变更。生产环境请使用 async-stream crate，不要依赖 nightly 的 gen blocks。

gen blocks 的核心价值：让编译器自动生成状态机，消除手写 poll_next 的样板代码。一旦稳定，它将取代 async-stream crate 成为首选方案。

背压控制模式#

背压（backpressure）是 Stream 体系的核心问题：如果消费者处理速度慢于生产者，数据会无限堆积。Rust 的异步生态提供了多种背压控制手段。

模式一：有限并发（buffered / buffer_unordered）#

1
use futures::stream::{self, StreamExt};
2

3
async fn fetch_url(url: &str) -> String {
4
    // 模拟 HTTP 请求
5
    tokio::time::sleep(std::time::Duration::from_millis(100)).await;
6
    format!("response from {url}")
7
}
8

9
async fn bounded_concurrency() {
10
    let urls = vec!["a", "b", "c", "d", "e", "f", "g", "h"];
11

12
    // 最多 3 个并发请求
13
    let results: Vec<String> = stream::iter(urls)
14
        .map(|url| fetch_url(url))
15
        .buffered(3)  // 关键：限制并发度
16
        .collect()
17
        .await;
18

19
    println!("获取到 {} 个结果", results.len());
20
}

buffered(n) 保证同时最多 n 个 Future 在执行。更精细的选择：

buffered(n)：保持顺序
buffer_unordered(n)：不保持顺序，稍快

模式二：Channel 背压#

1
use tokio::sync::mpsc;
2

3
async fn producer(tx: mpsc::Sender<u32>) {
4
    for i in 0..1000 {
5
        // channel 满时自动等待 —— 天然背压
6
        if tx.send(i).await.is_err() {
7
            break; // 消费者已关闭
8
        }
9
    }
10
}
11

12
async fn consumer(mut rx: mpsc::Receiver<u32>) {
13
    while let Some(val) = rx.recv().await {
14
        // 慢消费：模拟处理耗时
15
        tokio::time::sleep(std::time::Duration::from_millis(50)).await;
16
        println!("处理: {val}");
17
    }
18
}
19

20
#[tokio::main]
21
async fn main() {
22
    // channel 容量 16：生产者最多领先消费者 16 条消息
23
    let (tx, rx) = mpsc::channel(16);
24

25
    let p = tokio::spawn(producer(tx));
26
    let c = tokio::spawn(consumer(rx));
27

28
    let _ = p.await;
29
    let _ = c.await;
30
}

Channel 容量选择经验：

容量 1：最严格背压，几乎同步
容量 16~128：通用推荐，平衡吞吐与延迟
容量 1024+：高吞吐场景，但需注意内存开销

模式三：组合——Stream + Semaphore#

更灵活的背压控制，适合不同优先级的任务：

1
use tokio::sync::Semaphore;
2
use futures::stream::{self, StreamExt};
3

4
async fn rate_limited_processing() {
5
    let semaphore = std::sync::Arc::new(Semaphore::new(5));
6
    let items: Vec<u32> = (0..20).collect();
7

8
    let handles: Vec<_> = stream::iter(items)
9
        .map(|item| {
10
            let sem = semaphore.clone();
11
            async move {
12
                let _permit = sem.acquire().await.unwrap();
13
                // 最多 5 个并发
14
                tokio::time::sleep(std::time::Duration::from_millis(100)).await;
15
                item * 2
16
            }
17
        })
18
        .buffer_unordered(20)  // buffer 允许 20 个 task 在跑
19
        .collect()
20
        .await;
21

22
    println!("结果: {handles:?}");
23
}

这里 Semaphore 控制真正的并发度，buffer_unordered 控制缓冲区大小。两层分离，各司其职。

Stream 与 Iterator 的对比#

本质差异#

维度	Iterator	Stream
同步/异步	同步	异步
阻塞行为	`next()` 立即返回	`next().await` 可能等待
背压	天然存在（拉取模型）	需要显式控制
组合子	丰富且稳定	丰富但在 futures-util
标准库	`std::iter`	不在标准库
零开销	是	有调度开销
可组合性	同步组合	需要 async 组合

为什么 Stream 不在标准库？#

这是一个历史和政治问题。Stream 曾计划进入标准库（std::stream），但因为：

异步运行时碎片化（tokio vs async-std vs smol）
Stream 的 GAT 关联类型曾依赖 unstable 特性
社区对 Stream 应该支持 async fn next() 还是 poll_next 有分歧

目前 Stream 在 futures-core 中定义，所有运行时都依赖它。2026 年仍有提议将 Stream 稳定化进入标准库，但短期内不太可能落地。

从 Iterator 到 Stream 的转换#

1
use futures::stream::{self, StreamExt};
2

3
// Iterator -> Stream（零开销）
4
let s1 = stream::iter(0..10);
5

6
// Stream -> Iterator（需要阻塞）
7
// 不能直接转换！Stream 的 next() 是异步的
8
// 必须在运行时中 block_on：
9
use tokio::runtime::Runtime;
10
let rt = Runtime::new().unwrap();
11
let iter = rt.block_on(async {
12
    let mut s = stream::iter(0..10);
13
    let mut vec = vec![];
14
    while let Some(v) = s.next().await {
15
        vec.push(v);
16
    }
17
    vec
18
});

关键经验：不要试图在异步代码中混用 Iterator 和 Stream。选定一个模型后贯穿始终。如果你的数据源是同步的（如 Vec、HashMap），用 Iterator；如果涉及 I/O、网络、定时器，用 Stream。

实战经验总结#

1. 优先用 async-stream，别手写 poll_next#

手写 poll_next 的正确率在团队中极低。async-stream 的 stream! 宏在 99% 的场景下够用，性能损失可忽略。只有在极度性能敏感的路径（如每秒百万级事件流）才考虑手写。

2. 背压不是可选项#

在微服务架构中，一个没有背压控制的 Stream 就是内存泄漏的定时炸弹。永远为生产者-消费者链路设置背压——channel 容量、buffered(n)、Semaphore，至少选一个。

3. collect() 前确认有限性#

1
// 危险：无限 Stream + collect = 永远等待
2
let all: Vec<_> = infinite_stream().collect().await;
3

4
// 安全：用 take 限制
5
let first_100: Vec<_> = infinite_stream().take(100).collect().await;

4. Stream 的生命周期比 Future 更复杂#

Future 的生命周期通常较短（等一个值），Stream 的生命周期可能很长（持续运行）。注意 Stream 中的借用不能跨 .await 点——这是异步借用检查器（borrow checker）的限制，与 Future 一致。

5. pin_project 处理自引用 Stream#

如果你手写 Stream 且内部有自引用结构，使用 pin_project_lite 或 pin-project crate 来安全地实现 Unpin：

1
use pin_project_lite::pin_project;
2
use std::pin::Pin;
3
use std::task::{Context, Poll};
4
use futures::stream::Stream;
5

6
pin_project! {
7
    struct MyStream {
8
        data: Vec<u32>,
9
        #[pin]
10
        inner: SomeOtherStream,
11
    }
12
}

音乐

音乐