Run any Skill in Manus with one click

Get Started

$pwd:

rust-async-internals

Name: Rust Async Internals
Author: po4yka

// Rust async internals for select/join, blocking, JNI bridges, Tokio setup, cancellation, and io_loop.

Run Skill in Manus

$ git log --oneline --stat

stars:34

forks:3

updated:May 6, 2026 at 03:44

SKILL.md

readonly

package.json

"author": "po4yka"

"repository": "po4yka/RIPDPI"

View GitHub Repository

$ install --globalskills.sh

$ download --local

Run Skill in Manus

[HINT] Download the complete skill directory including SKILL.md and all related files

Run any Skill with one click

name	rust-async-internals
description	Rust async internals for select/join, blocking, JNI bridges, Tokio setup, cancellation, and io_loop.

Rust Async Internals -- RIPDPI

Purpose

Guide agents through async patterns specific to RIPDPI: JNI-to-async bridging, the io_loop event-driven architecture, tokio runtime configuration for Android NDK, CancellationToken-based shutdown, and common select!/join! pitfalls.

Triggers

"Why is the tunnel blocking / slow?"
"How does JNI call into async Rust here?"
"How does select! work and what are the pitfalls?"
"How is the tokio runtime configured?"
"How does cancellation / shutdown work?"

Project async architecture

RIPDPI has two independent tokio runtimes bridged from JNI:

Tunnel runtime (ripdpi-tunnel-android): shared multi_thread(2) runtime stored in a OnceCell<Arc<Runtime>>. A dedicated std::thread calls runtime.block_on(run_tunnel(...)) so the JNI thread is not blocked.
Proxy runtime (ripdpi-android): run_proxy_with_embedded_control blocks the calling JNI thread directly (the Android Service thread). Uses an IdleGuard drop-safety pattern to reset state on panic.

JNI-to-async bridge pattern

// ripdpi-tunnel-android/src/session/lifecycle.rs -- the canonical pattern
//
// JNI create_session() returns a jlong handle.
// JNI start_session() spawns a std::thread that calls runtime.block_on():
let worker = std::thread::Builder::new()
    .name("ripdpi-tunnel-worker".into())
    .spawn(move || {
        let result = std::panic::catch_unwind(AssertUnwindSafe(|| {
            runtime.block_on(run_tunnel(config, fd, cancel, stats))
        }));
        // Handle Ok/Err/panic, update last_error and telemetry
    });
// The JNI thread returns immediately; stop_session() cancels via token.

Key rules for this pattern:

Never call block_on from a JNI callback thread directly for long-running work -- spawn a dedicated thread so the JNI call returns promptly.
Duplicate file descriptors (nix::unistd::dup) before passing to async -- Android VpnService can revoke the original fd at any time.
Wrap block_on in catch_unwind -- a panic must not unwind through JNI.

Required tokio version floor

Do not downgrade tokio below these versions:

≥ 1.42.1 — fixes a broadcast::Sender::clone() soundness bug (missing synchronization for Send + !Sync payloads) and a CancellationToken race where futures that polled to Ready before the token fired were not cancelled. RIPDPI's connection-level abort paths rely on the cancellation fix. See the tokio CHANGELOG and PR #7462.
≥ 1.51.1 — fixes a file-descriptor leak when an io_uring open operation is cancelled before completion. ripdpi-tunnel-core cancels in-flight FS/IO operations on session teardown; below this version the leaked fds accumulate until the process exits. See PR #7983.

If cargo tree -i tokio shows a version below the floor, promote it in native/rust/Cargo.toml workspace dependencies (never downgrade a transitive dep to work around a breaking change — open an upstream issue instead).

Runtime configuration for Android NDK

// ripdpi-tunnel-android/src/session/registry.rs
tokio::runtime::Builder::new_multi_thread()
    .worker_threads(2)          // constrained for mobile battery/CPU
    .thread_stack_size(1024 * 1024) // 1 MiB -- Android default is small
    .thread_name("ripdpi-tunnel-tokio")
    .enable_all()
    .build()

Android-specific considerations:

worker_threads(2) balances throughput vs battery drain on mobile.
Explicit thread_stack_size(1 MiB) -- Android NDK default stack is often too small for deep async state machines.
The runtime is stored in OnceCell<Arc<Runtime>> and shared across sessions to avoid repeated thread pool creation.
current_thread runtime is used only in tests and the DNS resolver's synchronous resolve_blocking() path.

io_loop event-driven architecture

The core tunnel runs a single-task 6-phase loop (io_loop_task in ripdpi-tunnel-core/src/io_loop.rs):

Drain TUN fd -- read raw IP packets via AsyncFd::try_io, classify
smoltcp poll -- advance TCP state machines in userspace
New sessions -- detect ESTABLISHED sockets, spawn TcpSession tasks
Duplex bridge -- pump data between smoltcp sockets and session tasks
Flush tx_queue -- write smoltcp packets back to TUN fd
Wait -- select! on TUN readable / smoltcp timer / UDP / DNS / cancel

Phase 6 select pattern:

tokio::select! {
    _ = tun.readable() => {},
    _ = tokio::time::sleep(smol_delay) => {},
    udp_event = udp_rx.recv() => { handle_udp_event(...) }
    dns_result = async { ... }, if dns_resp_rx.is_some() => { ... }
    _ = cancel.cancelled() => { break; }
}

This is NOT a typical "spawn per connection" design. One task owns the entire smoltcp stack. Individual TCP/UDP sessions ARE spawned as separate tokio tasks that communicate back via mpsc channels and tokio::io::duplex pairs.

The smoltcp ↔ duplex bridge uses a NoopWaker-based manual poll pattern: see rust-io-loop skill for the full treatment of try_read_duplex / try_write_duplex in io_loop/bridge.rs:19-45. The short version: the bridge calls poll_read / poll_write directly from the io_loop tick with a discarded waker. Consequence: the try_*_duplex family must NEVER be called from inside an async await — a Poll::Pending under the NoopWaker stalls the task permanently. If you find yourself writing async fn wrappers around duplex streams in this crate, stop and consult the io_loop skill.

Blocking in async -- common mistakes

// WRONG: blocks a runtime thread, starves other tasks
async fn bad() {
    std::thread::sleep(Duration::from_secs(1));
    std::fs::read_to_string("file.txt").unwrap();
}

// CORRECT: async equivalents
async fn good() {
    tokio::time::sleep(Duration::from_secs(1)).await;
    tokio::fs::read_to_string("file.txt").await.unwrap();
}

// CORRECT: offload unavoidable blocking work
async fn with_blocking() {
    tokio::task::spawn_blocking(|| heavy_cpu_work()).await.unwrap();
}

In RIPDPI specifically: the WS tunnel (ripdpi-ws-tunnel) uses synchronous std::net::TcpStream + tungstenite deliberately -- it runs on a dedicated thread, not inside the tokio runtime. This is intentional, not a bug.

select! and join! pitfalls

// select! completes when FIRST branch finishes; LOSING branches are DROPPED.
// Their futures are cancelled -- any in-progress work is lost.
tokio::select! {
    result = fetch_a() => { /* fetch_b() dropped */ }
    result = fetch_b() => { /* fetch_a() dropped */ }
}

// Biased select: always checks branches in order. Use for priority shutdown.
loop {
    tokio::select! {
        biased;
        _ = cancel.cancelled() => break,  // always checked first
        msg = rx.recv() => process(msg),
    }
}

// join! waits for ALL to complete -- no cancellation surprise.
let (a, b) = tokio::join!(fetch_a(), fetch_b());

CancellationToken pattern (used throughout RIPDPI)

// Parent creates token, passes child tokens to spawned work
let cancel = CancellationToken::new();
let child_cancel = cancel.child_token();

tokio::spawn(async move {
    tokio::select! {
        _ = child_cancel.cancelled() => { /* clean shutdown */ }
        _ = do_work() => {}
    }
});

// Later: cancel.cancel() propagates to all child tokens

RIPDPI uses tokio_util::sync::CancellationToken (not tokio::sync::Notify) for structured shutdown. The tunnel session state machine transitions through Ready -> Starting -> Running -> Destroyed, with the token stored in the Starting and Running variants.

Debugging async issues

Task starvation: Look for blocking calls in async context. The io_loop is especially sensitive -- a single blocked poll starves all TCP sessions.
Deadlock at shutdown: Check that cancel.cancelled() is in every select! loop. Missing it causes tasks that never terminate.
fd leaks: Verify OwnedFd cleanup on all error paths in JNI lifecycle functions. The dup'd fd must be closed even if start_session fails.

Note: tokio-console is not practical for this project -- it requires the tokio_unstable cfg flag and console-subscriber, which add overhead and complexity to Android NDK cross-compilation. Use tracing spans and RUST_LOG filtering instead.

HRTB pitfalls in `Fn` callbacks

Severity: WARNING

Higher-Ranked Trait Bounds (HRTBs) — for<'a> FnMut(&'a T) -> K — are the correct shape for callbacks that take a reference and return something that must not outlive the reference. But several sharp edges exist:

Unexpressible with dependent output: If K depends on 'a (e.g., K = &'a str), this cannot be expressed without GATs today:

// Does NOT compile: K cannot depend on 'a without GATs
fn register<F: for<'a> FnMut(&'a str) -> &'a str>(f: F) {}

Workaround: use Box<dyn for<'a> Fn(&'a str) -> &'a str + 'static> or restructure to pass owned values.

Closure inference quirk: Closures in stable Rust default to fixed-lifetime inference, not HRTB inference. A closure |s: &str| s fails to implement for<'a> Fn(&'a str) -> &'a str in some compiler versions. Workaround: name the function explicitly or use a helper to force HRTB:

fn force_hrtb<F: for<'a> Fn(&'a str) -> &'a str>(f: F) -> F { f }
let cb = force_hrtb(|s| s);

Async-Fn with +Send + 'static: Pre-RPITIT (before Rust 1.75), async functions in traits cannot be expressed as F: for<'a> AsyncFn(&'a T). The canonical pre-1.75 workaround is F: Fn(&T) -> Pin<Box<dyn Future<Output = R> + Send + '_>>. Post-1.75 with async fn in traits, prefer trait MyTrait { async fn call(&self, t: &T) -> R; }.

In RIPDPI: the JNI-to-async bridge passes callbacks across thread boundaries; all closures captured in tokio::spawn must be 'static + Send. Avoid capturing &T references — convert to owned or Arc<T> before spawn.

Reference: crabbook/borrowing_in_generic_functions.md

Async + shared state in event loops

Severity: WARNING

A captured &mut State inside an async block lives for the entire Future's lifetime, from first poll to completion. Two concurrent Futures cannot share &mut State:

// DOES NOT COMPILE: two mutable borrows of `state` active at once
let f1 = async { state.handle_packet(pkt1) };
let f2 = async { state.handle_packet(pkt2) };
tokio::join!(f1, f2); // error[E0499]

Correct approaches (in order of preference for RIPDPI):

Single-task ownership (current io_loop design): one task owns State; all other tasks communicate via mpsc channels. No sharing needed. This is the RIPDPI canonical pattern — preserve it.
Arc<Mutex<State>>: correct but serializes access. Acceptable for low-contention config state; unacceptable on the packet path.
RefCell<State> inside a !Send single-threaded runtime: valid for current-thread runtimes. Not applicable to RIPDPI's multi-thread setup.

Nightly/future options (document only, do not use in production today):

Context::ext (unstable): pass state through Waker context without unsafe.
Generators with resume(arg): coroutine-style state handoff. Available via the generator-light crate on stable.

Reference: crabbook/event_loops_and_shared_state.md

`Pin` necessity in FFI

Severity: WARNING for self-referential or FFI types

Pin<&mut T> is a guarantee that T will not be moved after being pinned. This is required for:

Self-referential structs (a field contains a pointer to another field of the same struct) — the default use of Pin in async state machines.
FFI types that must not be moved after construction, because C++ objects have non-trivial move constructors that Rust cannot call.

In RIPDPI: cxx-generated bindings expose C++ types as Pin<&mut CppType>. This is correct — C++ may have a destructor that captures this, so the object address must remain stable. The same logic applies to any FFI handle allocated by C and returned by pointer: if the C API says "do not move this after init", wrap it in Pin<Box<T>> on the Rust side.

Rules:

Never write a self-referential struct without Pin + PhantomPinned.
Never store a raw pointer to a stack variable and then move the variable.
Box::pin(val) is the easiest way to heap-pin a value in Rust.
After pinning, use Pin::get_unchecked_mut only with a // SAFETY: we never move T after this point comment.

Reference: crabbook/pin.md

`impl Trait` (RPIT) overcaptures lifetimes in edition 2024

Severity: WARNING — edition migration hazard

In Rust 2021 and earlier, return-position impl Trait (RPIT) did NOT implicitly capture lifetime parameters unless explicitly listed. In Rust 2024, all in-scope lifetimes are captured automatically. Consequence: functions that were 'static-compatible in edition 2021 may become non-'static after migration because the return type now captures a lifetime from an input reference.

Concrete symptom: a function returning impl Future + 'static that takes &self now infers impl Future + '_ — breaking any tokio::spawn(obj.method()) call site.

Fix: use precise use<..> syntax (stabilized in Rust 1.82) to opt out of capturing specific lifetimes:

// Edition 2024: explicitly state which lifetimes/types are captured
fn process<'a>(data: &'a str) -> impl Future<Output = u32> + use<> {
    async move { data.len() as u32 }
}

The impl_trait_overcaptures lint (part of rust-2024-compatibility group) flags affected sites before migration. Run cargo fix --edition and inspect every RPIT diff carefully.

Reference: Rust Blog: impl Trait capture rules, Edition Guide RPIT section.

`tokio::time::timeout` is cooperative — never fires on non-yielding futures

Severity: CRITICAL

tokio::time::timeout wraps a future and checks the deadline before each poll. If the wrapped future never reaches an .await point — tight CPU loop, blocking syscall, heavy synchronous computation — the timeout never fires. The future runs to completion regardless of the deadline.

// DANGEROUS: looks protected but is not
let result = tokio::time::timeout(
    Duration::from_secs(1),
    async {
        // No .await -- timeout will never fire
        expensive_cpu_computation()
    }
).await;

Fix: any blocking or CPU-heavy work must be moved to spawn_blocking before wrapping with timeout:

let result = tokio::time::timeout(
    Duration::from_secs(1),
    tokio::task::spawn_blocking(|| expensive_cpu_computation())
).await;

In RIPDPI: strategy-probe candidates, DPI classification heuristics, and TLS fingerprinting are CPU-heavy paths. Never wrap them in timeout without also moving them to spawn_blocking.

`JoinSet` drop cannot abort `spawn_blocking` threads — silent shutdown hang

Severity: WARNING

When a JoinSet is dropped, it calls .abort() on all tracked futures. However, tasks spawned via spawn_blocking run on OS threads and Tokio documents that they cannot be cancelled by abort. If a JoinSet contains handles to async tasks that internally delegate to spawn_blocking (common in database pool workers, file I/O wrappers, and heavy computation), dropping the JoinSet during shutdown does not stop the underlying threads.

In practice: the process appears to shut down (the async tasks receive abort) but OS threads continue running until completion, potentially blocking process exit or causing Runtime::shutdown_timeout to fire.

Fix: for tasks that use spawn_blocking internally, prefer explicit cancellation signalling (a CancellationToken passed into the blocking closure) rather than relying on JoinSet abort. Always test shutdown behavior under load, not just happy-path sequential tests.

`spawn_blocking` pool exhaustion from long-lived tasks

Severity: WARNING

Tokio's blocking thread pool has a default cap of 512 threads. Each spawn_blocking call occupies one thread until the closure completes. Long-running or indefinitely-polling tasks — file watchers, polling loops, persistent connections — exhaust the pool. When the pool is saturated, new spawn_blocking calls queue, causing latency spikes that appear as async slowdowns with no obvious cause.

Decision rule:

spawn_blocking: bounded CPU work (target < 100 ms), occasional blocking syscalls, short file I/O.
std::thread::spawn: indefinite blocking work, event loops, long-lived watchers.

In RIPDPI: the WS tunnel relay (ripdpi-ws-tunnel) already uses std::thread::spawn correctly. Any new persistent blocking work MUST follow this pattern, not spawn_blocking.

`block_in_place` panics on `current_thread` runtime

Severity: WARNING

tokio::task::block_in_place migrates the current worker thread to the blocking pool and redistributes other tasks to remaining workers. Two hazards:

Panics on current_thread runtime: #[tokio::test] uses current_thread by default. Calling block_in_place inside a test panics with "can call blocking only when running on the multi-thread runtime". This causes confusing test failures when production code uses block_in_place.
Starves join! branches: inside a join!, other branches run on the same task. block_in_place suspends them for the duration of the blocking call, causing unexpected sequencing (branch A completes; branch B runs only after).

Fix: use spawn_blocking instead of block_in_place in both cases — it is safe on all runtime flavors and does not affect co-located tasks.

`broadcast` receiver silently drops messages on `Lagged`

Severity: WARNING

tokio::sync::broadcast channels have a fixed ring-buffer capacity. A slow receiver that falls behind will have old messages overwritten. The next recv() call returns Err(RecvError::Lagged(n)) — but most code handles only the Ok(msg) arm and treats Lagged as a transient error, silently dropping n events.

// BUG: Lagged silently discarded
while let Ok(msg) = rx.recv().await {
    process(msg);
}

// CORRECT: handle Lagged explicitly
loop {
    match rx.recv().await {
        Ok(msg) => process(msg),
        Err(RecvError::Lagged(n)) => {
            tracing::warn!("broadcast: dropped {} messages", n);
            // decide: continue, alert, or reconnect
        }
        Err(RecvError::Closed) => break,
    }
}

For audit logs, metrics, or state-machine transition messages, Lagged drops are data loss. Use mpsc with explicit backpressure for lossless delivery.

`std::sync::Mutex` guard across `.await` deadlocks silently

Severity: CRITICAL

std::sync::Mutex guards do not implement Send. The compiler rejects them in tokio::spawn futures (which require Send). However, in current_thread executors or non-Send futures, the compiler accepts the guard crossing an .await point. At runtime, if the executor schedules another task that acquires the same lock, it deadlocks: the async task is suspended holding the lock, and the other task blocks.

This pattern works in development (sequential load, single task) and deadlocks only under concurrent production load.

// DEADLOCK risk: guard lives across .await
let guard = mutex.lock().unwrap();
some_async_op().await;  // another task may try to lock here
drop(guard);

// CORRECT: drop before .await
let value = {
    let guard = mutex.lock().unwrap();
    guard.value.clone()
};
some_async_op().await;

Rule: if a Mutex guard must genuinely live across .await, use tokio::sync::Mutex. If it does not need to, drop the guard explicitly before any .await.

`async fn` in traits is not object-safe and has no `Send` bound

Severity: WARNING

async fn in traits was stabilized in Rust 1.75 (RPITIT). Three non-obvious hazards when replacing #[async_trait]:

Not dyn-safe: a trait containing async fn cannot be used as dyn Trait. Code that previously used Box<dyn MyTrait> (via #[async_trait] which boxes futures internally) breaks at compile time.
No automatic Send bound: native async fn in traits does not add Send to the returned future. tokio::spawn(obj.method()) fails because the future is not Send.
Fix for both: use #[trait_variant::make(MyTraitSend: Send)] from the trait-variant crate (part of the async-fn-in-trait stabilization roadmap) to generate a Send-compatible trait variant:

#[trait_variant::make(MyTraitSend: Send)]
pub trait MyTrait {
    async fn process(&self, input: &str) -> Result<String>;
}
// Now use `MyTraitSend` for tokio::spawn contexts

In RIPDPI: any trait with async fn methods used in tokio::spawn contexts MUST use trait_variant or keep #[async_trait]. Do not mass-replace #[async_trait] without auditing every Box<dyn> and tokio::spawn use site.

Related skills

.claude/skills/rust-debugging/ -- GDB/LLDB debugging of async Rust
.claude/skills/rust-profiling/ -- cargo-flamegraph with async stack frames
.claude/skills/memory-model/ -- memory ordering in async contexts

name	rust-async-internals
description	Rust async internals for select/join, blocking, JNI bridges, Tokio setup, cancellation, and io_loop.

Rust Async Internals -- RIPDPI

Purpose

Triggers

"Why is the tunnel blocking / slow?"
"How does JNI call into async Rust here?"
"How does select! work and what are the pitfalls?"
"How is the tokio runtime configured?"
"How does cancellation / shutdown work?"

Project async architecture

RIPDPI has two independent tokio runtimes bridged from JNI:

Tunnel runtime (ripdpi-tunnel-android): shared multi_thread(2) runtime stored in a OnceCell<Arc<Runtime>>. A dedicated std::thread calls runtime.block_on(run_tunnel(...)) so the JNI thread is not blocked.
Proxy runtime (ripdpi-android): run_proxy_with_embedded_control blocks the calling JNI thread directly (the Android Service thread). Uses an IdleGuard drop-safety pattern to reset state on panic.

JNI-to-async bridge pattern

// ripdpi-tunnel-android/src/session/lifecycle.rs -- the canonical pattern
//
// JNI create_session() returns a jlong handle.
// JNI start_session() spawns a std::thread that calls runtime.block_on():
let worker = std::thread::Builder::new()
    .name("ripdpi-tunnel-worker".into())
    .spawn(move || {
        let result = std::panic::catch_unwind(AssertUnwindSafe(|| {
            runtime.block_on(run_tunnel(config, fd, cancel, stats))
        }));
        // Handle Ok/Err/panic, update last_error and telemetry
    });
// The JNI thread returns immediately; stop_session() cancels via token.

Key rules for this pattern:

Never call block_on from a JNI callback thread directly for long-running work -- spawn a dedicated thread so the JNI call returns promptly.
Duplicate file descriptors (nix::unistd::dup) before passing to async -- Android VpnService can revoke the original fd at any time.
Wrap block_on in catch_unwind -- a panic must not unwind through JNI.

Required tokio version floor

Do not downgrade tokio below these versions:

≥ 1.42.1 — fixes a broadcast::Sender::clone() soundness bug (missing synchronization for Send + !Sync payloads) and a CancellationToken race where futures that polled to Ready before the token fired were not cancelled. RIPDPI's connection-level abort paths rely on the cancellation fix. See the tokio CHANGELOG and PR #7462.
≥ 1.51.1 — fixes a file-descriptor leak when an io_uring open operation is cancelled before completion. ripdpi-tunnel-core cancels in-flight FS/IO operations on session teardown; below this version the leaked fds accumulate until the process exits. See PR #7983.

Runtime configuration for Android NDK

// ripdpi-tunnel-android/src/session/registry.rs
tokio::runtime::Builder::new_multi_thread()
    .worker_threads(2)          // constrained for mobile battery/CPU
    .thread_stack_size(1024 * 1024) // 1 MiB -- Android default is small
    .thread_name("ripdpi-tunnel-tokio")
    .enable_all()
    .build()

Android-specific considerations:

worker_threads(2) balances throughput vs battery drain on mobile.
Explicit thread_stack_size(1 MiB) -- Android NDK default stack is often too small for deep async state machines.
The runtime is stored in OnceCell<Arc<Runtime>> and shared across sessions to avoid repeated thread pool creation.
current_thread runtime is used only in tests and the DNS resolver's synchronous resolve_blocking() path.

io_loop event-driven architecture

The core tunnel runs a single-task 6-phase loop (io_loop_task in ripdpi-tunnel-core/src/io_loop.rs):

Drain TUN fd -- read raw IP packets via AsyncFd::try_io, classify
smoltcp poll -- advance TCP state machines in userspace
New sessions -- detect ESTABLISHED sockets, spawn TcpSession tasks
Duplex bridge -- pump data between smoltcp sockets and session tasks
Flush tx_queue -- write smoltcp packets back to TUN fd
Wait -- select! on TUN readable / smoltcp timer / UDP / DNS / cancel

Phase 6 select pattern:

tokio::select! {
    _ = tun.readable() => {},
    _ = tokio::time::sleep(smol_delay) => {},
    udp_event = udp_rx.recv() => { handle_udp_event(...) }
    dns_result = async { ... }, if dns_resp_rx.is_some() => { ... }
    _ = cancel.cancelled() => { break; }
}

Blocking in async -- common mistakes

// WRONG: blocks a runtime thread, starves other tasks
async fn bad() {
    std::thread::sleep(Duration::from_secs(1));
    std::fs::read_to_string("file.txt").unwrap();
}

// CORRECT: async equivalents
async fn good() {
    tokio::time::sleep(Duration::from_secs(1)).await;
    tokio::fs::read_to_string("file.txt").await.unwrap();
}

// CORRECT: offload unavoidable blocking work
async fn with_blocking() {
    tokio::task::spawn_blocking(|| heavy_cpu_work()).await.unwrap();
}

select! and join! pitfalls

// select! completes when FIRST branch finishes; LOSING branches are DROPPED.
// Their futures are cancelled -- any in-progress work is lost.
tokio::select! {
    result = fetch_a() => { /* fetch_b() dropped */ }
    result = fetch_b() => { /* fetch_a() dropped */ }
}

// Biased select: always checks branches in order. Use for priority shutdown.
loop {
    tokio::select! {
        biased;
        _ = cancel.cancelled() => break,  // always checked first
        msg = rx.recv() => process(msg),
    }
}

// join! waits for ALL to complete -- no cancellation surprise.
let (a, b) = tokio::join!(fetch_a(), fetch_b());

CancellationToken pattern (used throughout RIPDPI)

// Parent creates token, passes child tokens to spawned work
let cancel = CancellationToken::new();
let child_cancel = cancel.child_token();

tokio::spawn(async move {
    tokio::select! {
        _ = child_cancel.cancelled() => { /* clean shutdown */ }
        _ = do_work() => {}
    }
});

// Later: cancel.cancel() propagates to all child tokens

Debugging async issues

Task starvation: Look for blocking calls in async context. The io_loop is especially sensitive -- a single blocked poll starves all TCP sessions.
Deadlock at shutdown: Check that cancel.cancelled() is in every select! loop. Missing it causes tasks that never terminate.
fd leaks: Verify OwnedFd cleanup on all error paths in JNI lifecycle functions. The dup'd fd must be closed even if start_session fails.

HRTB pitfalls in `Fn` callbacks

Severity: WARNING

Unexpressible with dependent output: If K depends on 'a (e.g., K = &'a str), this cannot be expressed without GATs today:

// Does NOT compile: K cannot depend on 'a without GATs
fn register<F: for<'a> FnMut(&'a str) -> &'a str>(f: F) {}

Workaround: use Box<dyn for<'a> Fn(&'a str) -> &'a str + 'static> or restructure to pass owned values.

fn force_hrtb<F: for<'a> Fn(&'a str) -> &'a str>(f: F) -> F { f }
let cb = force_hrtb(|s| s);

Reference: crabbook/borrowing_in_generic_functions.md

Async + shared state in event loops

Severity: WARNING

A captured &mut State inside an async block lives for the entire Future's lifetime, from first poll to completion. Two concurrent Futures cannot share &mut State:

// DOES NOT COMPILE: two mutable borrows of `state` active at once
let f1 = async { state.handle_packet(pkt1) };
let f2 = async { state.handle_packet(pkt2) };
tokio::join!(f1, f2); // error[E0499]

Correct approaches (in order of preference for RIPDPI):

Single-task ownership (current io_loop design): one task owns State; all other tasks communicate via mpsc channels. No sharing needed. This is the RIPDPI canonical pattern — preserve it.
Arc<Mutex<State>>: correct but serializes access. Acceptable for low-contention config state; unacceptable on the packet path.
RefCell<State> inside a !Send single-threaded runtime: valid for current-thread runtimes. Not applicable to RIPDPI's multi-thread setup.

Nightly/future options (document only, do not use in production today):

Context::ext (unstable): pass state through Waker context without unsafe.
Generators with resume(arg): coroutine-style state handoff. Available via the generator-light crate on stable.

Reference: crabbook/event_loops_and_shared_state.md

`Pin` necessity in FFI

Severity: WARNING for self-referential or FFI types

Pin<&mut T> is a guarantee that T will not be moved after being pinned. This is required for:

Self-referential structs (a field contains a pointer to another field of the same struct) — the default use of Pin in async state machines.
FFI types that must not be moved after construction, because C++ objects have non-trivial move constructors that Rust cannot call.

Rules:

Never write a self-referential struct without Pin + PhantomPinned.
Never store a raw pointer to a stack variable and then move the variable.
Box::pin(val) is the easiest way to heap-pin a value in Rust.
After pinning, use Pin::get_unchecked_mut only with a // SAFETY: we never move T after this point comment.

Reference: crabbook/pin.md

`impl Trait` (RPIT) overcaptures lifetimes in edition 2024

Severity: WARNING — edition migration hazard

Concrete symptom: a function returning impl Future + 'static that takes &self now infers impl Future + '_ — breaking any tokio::spawn(obj.method()) call site.

Fix: use precise use<..> syntax (stabilized in Rust 1.82) to opt out of capturing specific lifetimes:

// Edition 2024: explicitly state which lifetimes/types are captured
fn process<'a>(data: &'a str) -> impl Future<Output = u32> + use<> {
    async move { data.len() as u32 }
}

The impl_trait_overcaptures lint (part of rust-2024-compatibility group) flags affected sites before migration. Run cargo fix --edition and inspect every RPIT diff carefully.

Reference: Rust Blog: impl Trait capture rules, Edition Guide RPIT section.

`tokio::time::timeout` is cooperative — never fires on non-yielding futures

Severity: CRITICAL

// DANGEROUS: looks protected but is not
let result = tokio::time::timeout(
    Duration::from_secs(1),
    async {
        // No .await -- timeout will never fire
        expensive_cpu_computation()
    }
).await;

Fix: any blocking or CPU-heavy work must be moved to spawn_blocking before wrapping with timeout:

let result = tokio::time::timeout(
    Duration::from_secs(1),
    tokio::task::spawn_blocking(|| expensive_cpu_computation())
).await;

In RIPDPI: strategy-probe candidates, DPI classification heuristics, and TLS fingerprinting are CPU-heavy paths. Never wrap them in timeout without also moving them to spawn_blocking.

`JoinSet` drop cannot abort `spawn_blocking` threads — silent shutdown hang

Severity: WARNING

`spawn_blocking` pool exhaustion from long-lived tasks

Severity: WARNING

Decision rule:

spawn_blocking: bounded CPU work (target < 100 ms), occasional blocking syscalls, short file I/O.
std::thread::spawn: indefinite blocking work, event loops, long-lived watchers.

In RIPDPI: the WS tunnel relay (ripdpi-ws-tunnel) already uses std::thread::spawn correctly. Any new persistent blocking work MUST follow this pattern, not spawn_blocking.

`block_in_place` panics on `current_thread` runtime

Severity: WARNING

tokio::task::block_in_place migrates the current worker thread to the blocking pool and redistributes other tasks to remaining workers. Two hazards:

Panics on current_thread runtime: #[tokio::test] uses current_thread by default. Calling block_in_place inside a test panics with "can call blocking only when running on the multi-thread runtime". This causes confusing test failures when production code uses block_in_place.
Starves join! branches: inside a join!, other branches run on the same task. block_in_place suspends them for the duration of the blocking call, causing unexpected sequencing (branch A completes; branch B runs only after).

Fix: use spawn_blocking instead of block_in_place in both cases — it is safe on all runtime flavors and does not affect co-located tasks.

`broadcast` receiver silently drops messages on `Lagged`

Severity: WARNING

// BUG: Lagged silently discarded
while let Ok(msg) = rx.recv().await {
    process(msg);
}

// CORRECT: handle Lagged explicitly
loop {
    match rx.recv().await {
        Ok(msg) => process(msg),
        Err(RecvError::Lagged(n)) => {
            tracing::warn!("broadcast: dropped {} messages", n);
            // decide: continue, alert, or reconnect
        }
        Err(RecvError::Closed) => break,
    }
}

For audit logs, metrics, or state-machine transition messages, Lagged drops are data loss. Use mpsc with explicit backpressure for lossless delivery.

`std::sync::Mutex` guard across `.await` deadlocks silently

Severity: CRITICAL

This pattern works in development (sequential load, single task) and deadlocks only under concurrent production load.

// DEADLOCK risk: guard lives across .await
let guard = mutex.lock().unwrap();
some_async_op().await;  // another task may try to lock here
drop(guard);

// CORRECT: drop before .await
let value = {
    let guard = mutex.lock().unwrap();
    guard.value.clone()
};
some_async_op().await;

Rule: if a Mutex guard must genuinely live across .await, use tokio::sync::Mutex. If it does not need to, drop the guard explicitly before any .await.

`async fn` in traits is not object-safe and has no `Send` bound

Severity: WARNING

async fn in traits was stabilized in Rust 1.75 (RPITIT). Three non-obvious hazards when replacing #[async_trait]:

Not dyn-safe: a trait containing async fn cannot be used as dyn Trait. Code that previously used Box<dyn MyTrait> (via #[async_trait] which boxes futures internally) breaks at compile time.
No automatic Send bound: native async fn in traits does not add Send to the returned future. tokio::spawn(obj.method()) fails because the future is not Send.
Fix for both: use #[trait_variant::make(MyTraitSend: Send)] from the trait-variant crate (part of the async-fn-in-trait stabilization roadmap) to generate a Send-compatible trait variant:

#[trait_variant::make(MyTraitSend: Send)]
pub trait MyTrait {
    async fn process(&self, input: &str) -> Result<String>;
}
// Now use `MyTraitSend` for tokio::spawn contexts

Related skills

.claude/skills/rust-debugging/ -- GDB/LLDB debugging of async Rust
.claude/skills/rust-profiling/ -- cargo-flamegraph with async stack frames
.claude/skills/memory-model/ -- memory ordering in async contexts

rust-async-internals

Rust Async Internals -- RIPDPI

Purpose

Triggers

Project async architecture

JNI-to-async bridge pattern

Required tokio version floor

Runtime configuration for Android NDK

io_loop event-driven architecture

Blocking in async -- common mistakes

select! and join! pitfalls

CancellationToken pattern (used throughout RIPDPI)

Debugging async issues

HRTB pitfalls in Fn callbacks

Async + shared state in event loops

Pin necessity in FFI

impl Trait (RPIT) overcaptures lifetimes in edition 2024

tokio::time::timeout is cooperative — never fires on non-yielding futures

JoinSet drop cannot abort spawn_blocking threads — silent shutdown hang

spawn_blocking pool exhaustion from long-lived tasks

block_in_place panics on current_thread runtime

broadcast receiver silently drops messages on Lagged

std::sync::Mutex guard across .await deadlocks silently

async fn in traits is not object-safe and has no Send bound

Related skills

Rust Async Internals -- RIPDPI

Purpose

Triggers

Project async architecture

JNI-to-async bridge pattern

Required tokio version floor

Runtime configuration for Android NDK

io_loop event-driven architecture

Blocking in async -- common mistakes

select! and join! pitfalls

CancellationToken pattern (used throughout RIPDPI)

Debugging async issues

HRTB pitfalls in Fn callbacks

Async + shared state in event loops

Pin necessity in FFI

impl Trait (RPIT) overcaptures lifetimes in edition 2024

tokio::time::timeout is cooperative — never fires on non-yielding futures

JoinSet drop cannot abort spawn_blocking threads — silent shutdown hang

spawn_blocking pool exhaustion from long-lived tasks

block_in_place panics on current_thread runtime

broadcast receiver silently drops messages on Lagged

std::sync::Mutex guard across .await deadlocks silently

async fn in traits is not object-safe and has no Send bound

Related skills

HRTB pitfalls in `Fn` callbacks

`Pin` necessity in FFI

`impl Trait` (RPIT) overcaptures lifetimes in edition 2024

`tokio::time::timeout` is cooperative — never fires on non-yielding futures

`JoinSet` drop cannot abort `spawn_blocking` threads — silent shutdown hang

`spawn_blocking` pool exhaustion from long-lived tasks

`block_in_place` panics on `current_thread` runtime

`broadcast` receiver silently drops messages on `Lagged`

`std::sync::Mutex` guard across `.await` deadlocks silently

`async fn` in traits is not object-safe and has no `Send` bound

HRTB pitfalls in `Fn` callbacks

`Pin` necessity in FFI

`impl Trait` (RPIT) overcaptures lifetimes in edition 2024

`tokio::time::timeout` is cooperative — never fires on non-yielding futures

`JoinSet` drop cannot abort `spawn_blocking` threads — silent shutdown hang

`spawn_blocking` pool exhaustion from long-lived tasks

`block_in_place` panics on `current_thread` runtime

`broadcast` receiver silently drops messages on `Lagged`

`std::sync::Mutex` guard across `.await` deadlocks silently

`async fn` in traits is not object-safe and has no `Send` bound