Throughput planning
Use bounded concurrency and staged batch windows instead of unbounded bursts.
Handling 429 responses
- Pause request stream.
- Wait using backoff + jitter.
- Resume gradually.
Implementation notes
- Keep per-tenant queues where relevant.
- Monitor sustained throttle rate, not single spikes.
- Separate retry workers from primary ingestion workers.