Researchers build an encrypted routing layer for private AI inference

Secure multi-party computation (MPC) lets organizations run AI models without exposing raw inputs by splitting data into encrypted fragments and distributing computation across non-colluding servers. SecureRouter adds input-adaptive encrypted routing to select a suitably sized model from a pool, cutting average encrypted inference time substantially compared to a fixed large-model approach while keeping accuracy close to the large-model baseline. #SecureRouter #SecFormer

Keypoints

Secure multi-party computation (MPC) enables private AI inference by splitting data into encrypted fragments processed by non-colluding servers.
Prior private-inference methods run the same large model for every query, making encrypted inference slow and expensive.
SecureRouter performs encrypted, input-adaptive routing to choose an appropriate model from a pool without exposing the routing decision.
SecureRouter reduced average encrypted inference time by 1.95× versus SecFormer and achieved up to 2.19× speedups on simpler tasks with minimal accuracy loss on most benchmarks.
The routing layer adds modest overhead (≈39 MB memory, ~4 seconds latency, ~1.86 GB network) and integrates with existing MPC frameworks and standard model architectures.

SHARE THIS STORY

WhatsApp X (Twitter)Telegram Bluesky Facebook LinkedIn Threads Email Print