Pregunta de entrevista de SAS

Let's say you are implementing a distributed sort algorithm running on hundreds of nodes. How does that algorithm work? Where are the bottlenecks?

Respuesta de la entrevista

Anónimo

6 de dic de 2012

Each node does its own sort, then funnels up to another node which does a "reduce" operation and then funnels up another layer. Bottleneck may well be I/O depending on the network fabric since all data must be distributed, then moved up to a single node at the end.

3