Pregunta de entrevista de NVIDIA

Questions around Quantization, inference optimization , LLM system design