Pregunta de entrevista de Zendesk

How would you design an Realtime LLM Inference Service