Researchers build an encrypted routing layer for private AI inference

Researchers build an encrypted routing layer for private AI inference

Secure multi-party computation (MPC) lets organizations run AI models without exposing raw inputs by splitting data into encrypted fragments and distributing computation across non-colluding servers. SecureRouter adds input-adaptive encrypted routing to select a suitably sized model from a pool, cutting average encrypted inference time substantially compared to a fixed large-model approach while keeping accuracy close to the large-model baseline. #SecureRouter #SecFormer

Keypoints

  • Secure multi-party computation (MPC) enables private AI inference by splitting data into encrypted fragments processed by non-colluding servers.
  • Prior private-inference methods run the same large model for every query, making encrypted inference slow and expensive.
  • SecureRouter performs encrypted, input-adaptive routing to choose an appropriate model from a pool without exposing the routing decision.
  • SecureRouter reduced average encrypted inference time by 1.95× versus SecFormer and achieved up to 2.19× speedups on simpler tasks with minimal accuracy loss on most benchmarks.
  • The routing layer adds modest overhead (≈39 MB memory, ~4 seconds latency, ~1.86 GB network) and integrates with existing MPC frameworks and standard model architectures.

Read More: https://www.helpnetsecurity.com/2026/04/21/securerouter-encrypted-ai-inference/