Rust 2026 经验谈 - 异步 + FFI：桥接同步与异步世界

Rust 的异步运行时基于 epoll/kqueue/io_uring 等非阻塞 I/O，但现实世界中大量 C 库是阻塞的：数据库驱动、加密库、文件系统操作、硬件 SDK。如何在异步运行时中安全高效地调用这些阻塞代码，是 Rust 生产环境的核心挑战。本文系统梳理异步与 FFI 桥接的各个模式。

block_in_place vs spawn_blocking#

Tokio 提供了两种在异步上下文中执行阻塞代码的方式，它们的行为差异微妙但关键。

spawn_blocking：最安全的选择#

1
use tokio::task;
2

3
async fn call_blocking_c_lib(input: &[u8]) -> Vec<u8> {
4
    // 将阻塞调用提交到专门的阻塞线程池
5
    let input = input.to_owned();
6
    task::spawn_blocking(move || {
7
        // 这里的代码运行在阻塞线程池中
8
        // 不会占用异步工作线程
9
        expensive_c_computation(&input)
10
    })
11
    .await
12
    .expect("spawn_blocking panicked")
13
}
14

15
fn expensive_c_computation(data: &[u8]) -> Vec<u8> {
16
    // 模拟阻塞的 C 库调用
17
    std::thread::sleep(std::time::Duration::from_millis(100));
18
    data.to_vec()
19
}

spawn_blocking 的工作原理：

将闭包提交到一个独立的多线程线程池（与异步运行时分离）
异步工作线程立即释放，可以调度其他 task
闭包执行完毕后，结果通过 oneshot channel 传回异步世界

block_in_place：在当前线程上阻塞#

1
use tokio::task;
2

3
async fn call_blocking_in_place(input: &[u8]) -> Vec<u8> {
4
    let input = input.to_owned();
5
    // 在当前线程上阻塞，但告知运行时可以借用此线程
6
    task::block_in_place(move || {
7
        expensive_c_computation(&input)
8
    })
9
}

block_in_place 的工作原理：

在当前异步工作线程上执行闭包
通知运行时”这个线程暂时不可用”，运行时会增加一个工作线程来补偿
闭包返回后，当前线程恢复为异步工作线程

关键差异对比#

维度	spawn_blocking	block_in_place
执行线程	专门的阻塞线程池	当前异步工作线程
线程创建	无（复用阻塞池）	可能触发新工作线程
闭包约束	`'static + Send`	`'static + Send`
task 间影响	无	临时减少异步工作线程数
适用场景	长时间阻塞	短时间阻塞
current-thread 运行时	支持	不支持（panic）
多次调用	线程池复用，高效	可能反复创建线程
与 task::spawn 交互	安全	安全

选型建议：

默认用 spawn_blocking。它是更安全、更可预测的选择。
block_in_place 适合阻塞时间极短（< 1ms）的场景，避免 spawn_blocking 的闭包提交开销。
在 current_thread 运行时中，只能用 spawn_blocking，block_in_place 会 panic。
在多层嵌套中，block_in_place 可以在 spawn_blocking 闭包内使用，但反过来不行。

常见错误：在 async fn 中直接阻塞#

1
// 错误！会阻塞异步工作线程，影响同一线程上的其他 task
2
async fn bad_blocking() -> String {
3
    std::thread::sleep(std::time::Duration::from_secs(5)); // 整个线程卡住！
4
    String::from("done")
5
}
6

7
// 正确：用 spawn_blocking
8
async fn good_blocking() -> String {
9
    tokio::task::spawn_blocking(|| {
10
        std::thread::sleep(std::time::Duration::from_secs(5));
11
        String::from("done")
12
    })
13
    .await
14
    .unwrap()
15
}

直接在 async fn 中调用阻塞函数是新手最常犯的错误，后果是整个工作线程被卡住，同一线程上的其他 task 全部无法调度。

criterion 异步 Benchmark 实操#

对异步代码做 benchmark 比同步代码复杂，因为需要运行时。以下是 2026 年的推荐实践：

基本模式#

1
use criterion::{criterion_group, criterion_main, Criterion, BenchmarkId};
2
use tokio::runtime::Runtime;
3

4
fn bench_async_function(c: &mut Criterion) {
5
    let rt = Runtime::new().unwrap();
6

7
    c.bench_function("async_computation", |b| {
8
        b.to_async(&rt).iter(|| async {
9
            some_async_work().await
10
        })
11
    });
12
}
13

14
async fn some_async_work() -> u64 {
15
    tokio::time::sleep(std::time::Duration::from_micros(10)).await;
16
    42
17
}
18

19
criterion_group!(benches, bench_async_function);
20
criterion_main!(benches);

b.to_async(&rt) 是 criterion 提供的异步 benchmark 适配器，它在指定的运行时上执行 async 闭包。

对比 spawn_blocking 开销#

1
fn bench_spawn_vs_block_in_place(c: &mut Criterion) {
2
    let rt = Runtime::new().unwrap();
3

4
    let mut group = c.benchmark_group("blocking_methods");
5

6
    group.bench_function("spawn_blocking", |b| {
7
        b.to_async(&rt).iter(|| async {
8
            tokio::task::spawn_blocking(|| {
9
                std::thread::sleep(std::time::Duration::from_micros(100));
10
            })
11
            .await
12
            .unwrap()
13
        })
14
    });
15

16
    group.bench_function("block_in_place", |b| {
17
        b.to_async(&rt).iter(|| async {
18
            tokio::task::block_in_place(|| {
19
                std::thread::sleep(std::time::Duration::from_micros(100));
20
            })
21
        })
22
    });
23

24
    group.finish();
25
}

实测结果（在我的 i7-13700K 上）：

阻塞 100μs 的任务，spawn_blocking 均值约 115μs（含线程池调度）
block_in_place 均值约 102μs（直接在当前线程）
差异随阻塞时间缩小而变得显著：10μs 任务差异可达 30%

注意事项#

不要在 benchmark 中创建新运行时——在函数外创建一次，传引用进去
异步 benchmark 不测量运行时启动时间——那不是你的代码开销
用 black_box 防止优化消除——criterion::black_box(result)
对含 I/O 的 benchmark 要谨慎——结果不稳定，考虑 mock

与 C 库的异步交互模式#

模式一：回调转 Future#

很多 C 库采用回调模式（注册回调函数，异步通知结果）。将其转为 Rust Future 的标准模式：

1
use std::future::Future;
2
use std::pin::Pin;
3
use std::sync::{Arc, Mutex};
4
use std::task::{Context, Poll, Waker};
5

6
struct CCallbackFuture {
7
    result: Arc<Mutex<Option<i32>>>,
8
    waker: Arc<Mutex<Option<Waker>>>,
9
}
10

11
impl Future for CCallbackFuture {
12
    type Output = i32;
13

14
    fn poll(self: Pin<&mut Self>, cx: &mut Context<'_>) -> Poll<Self::Output> {
15
        let mut result = self.result.lock().unwrap();
16
        if let Some(val) = result.take() {
17
            Poll::Ready(val)
18
        } else {
19
            // 保存 waker，C 回调触发时唤醒
20
            *self.waker.lock().unwrap() = Some(cx.waker().clone());
21
            Poll::Pending
22
        }
23
    }
24
}
25

26
// C 回调函数
27
unsafe extern "C" fn on_complete(value: i32, user_data: *mut std::ffi::c_void) {
28
    let data = unsafe { &*(user_data as *const CallbackData) };
29
    *data.result.lock().unwrap() = Some(value);
30
    if let Some(waker) = data.waker.lock().unwrap().take() {
31
        waker.wake();
32
    }
33
}
34

35
struct CallbackData {
36
    result: Arc<Mutex<Option<i32>>>,
37
    waker: Arc<Mutex<Option<Waker>>>,
38
}

关键要点：

用 Arc<Mutex<>> 在 C 回调和 Rust Future 间共享状态
回调中必须唤醒 waker，否则 Future 永远 Pending
user_data 指针的生命周期管理是最大的安全风险——确保 Future 活着时 user_data 有效

模式二：轮询模式（Polling C API）#

某些 C 库提供非阻塞 poll 接口：

1
use tokio::io::Interest;
2
use tokio::net::UdpSocket;
3

4
async fn poll_c_fd() -> std::io::Result<()> {
5
    // 将 C 库的 fd 注册到 tokio 的 epoll
6
    // 注意：from_raw_fd 需要 unsafe，且要求 fd 有效且未被 tokio 接管
7
    let socket = unsafe { UdpSocket::from_raw_fd(c_library_fd) };
8

9
    loop {
10
        socket.ready(Interest::READABLE).await?;
11
        // fd 可读，调用 C 库的非阻塞读取
12
        let result = unsafe { c_lib_nonblocking_read() };
13
        if result > 0 {
14
            // 处理数据
15
        }
16
    }
17
}

模式三：io_uring 集成思路#

io_uring 是 Linux 5.1+ 的异步 I/O 接口，与 Rust 异步运行时有天然的匹配性：

1
// 使用 io_uring 的概念模型（通过 tokio-uring 或 glommio）
2
async fn uring_read_example() -> std::io::Result<Vec<u8>> {
3
    // tokio-uring 的写法
4
    use tokio_uring::fs::File;
5

6
    let file = File::open("huge_file.dat").await?;
7
    let buf = vec![0u8; 4096];
8
    let (res, buf) = file.read_at(buf, 0).await?;
9
    Ok(buf[..res].to_vec())
10
}

io_uring 集成的挑战：

与 tokio 的兼容性：tokio 默认用 epoll，io_uring 需要独立的运行时（tokio-uring）或替代运行时（glommio）
缓冲区所有权：io_uring 要求缓冲区在操作完成前保持有效，这与 Rust 的借用模型有摩擦
内核版本要求：需要 Linux 5.1+（某些特性需 5.6+），跨平台部署受限
调优参数：uring 的 entry 数量、fixed buffer、SQE 批量提交等需要针对场景调优

在异步运行时中调用阻塞系统库的经验#

经验一：识别你的阻塞边界#

不是所有 FFI 都是阻塞的。分类判断：

类型	例子	处理方式
CPU 密集	加密、压缩、图像处理	`spawn_blocking`
阻塞 I/O	传统文件 I/O、数据库驱动	`spawn_blocking`
非阻塞 I/O	epoll-based C 库	fd 注册到运行时
回调式	libcurl、libuv	回调转 Future
长期阻塞	硬件等待、串口读	独立线程 + channel

经验二：控制阻塞线程池大小#

Tokio 的阻塞线程池默认最大 512 个线程。对于 CPU 密集型任务，你应该限制到 CPU 核心数：

1
use tokio::runtime::Builder;
2

3
let rt = Builder::new_multi_thread()
4
    .worker_threads(4)        // 异步工作线程
5
    .max_blocking_threads(8)  // 阻塞线程池上限
6
    .build()
7
    .unwrap();

过大的阻塞池会导致线程竞争和调度开销；过小会导致任务排队。经验公式：CPU 密集型设为核心数，I/O 密集型可设为核心数的 2~4 倍。

经验三：避免在 spawn_blocking 中持有异步锁#

1
// 危险：在阻塞代码中持有异步 Mutex 的锁
2
async fn bad_pattern() {
3
    let data = Arc::new(tokio::sync::Mutex::new(vec![]));
4
    let data_clone = data.clone();
5

6
    tokio::task::spawn_blocking(move || {
7
        let mut guard = data_clone.blocking_lock(); // 这可以工作...
8
        guard.push(1);
9
        // 但如果另一个 task 在等锁，可能死锁
10
    })
11
    .await
12
    .unwrap();
13
}
14

15
// 更安全：在异步侧获取锁，将数据传入阻塞闭包
16
async fn good_pattern() {
17
    let data = Arc::new(tokio::sync::Mutex::new(vec![]));
18

19
    {
20
        let mut guard = data.lock().await;
21
        let inner = &mut *guard;
22
        tokio::task::spawn_blocking(move || {
23
            // 对 inner 的数据做阻塞处理
24
        })
25
        .await
26
        .unwrap();
27
    }
28
}

经验四：为 C 库的线程安全性建档#

调用 C 库前，确认其线程安全模型：

线程安全：可以自由在 spawn_blocking 中并发调用
线程局部状态：每个 spawn_blocking 闭包是独立线程，线程局部状态不共享——需要显式传递
全局锁：C 库内部可能有全局 mutex，并发 spawn_blocking 调用实际串行执行，性能不如预期

经验五：cbindgen 与建桥的自动化#

当 Rust 异步代码需要暴露给 C 调用时：

1
// Rust 侧
2
#[unsafe(no_mangle)]
3
pub unsafe extern "C" fn rust_async_operation(
4
    input: *const u8,
5
    input_len: usize,
6
    callback: unsafe extern "C" fn(*mut u8, usize),
7
) {
8
    let input_data = unsafe { std::slice::from_raw_parts(input, input_len) }.to_vec();
9

10
    // 在独立运行时中启动异步操作
11
    std::thread::spawn(move || {
12
        let rt = tokio::runtime::Runtime::new().unwrap();
13
        rt.block_on(async {
14
            let result = async_process(&input_data).await;
15
            callback(result.as_ptr() as *mut u8, result.len());
16
        });
17
    });
18
}

注意这里创建了独立运行时——因为 C 侧的调用不受 Rust 运行时管理，必须自行管理生命周期。

音乐

音乐