LLaVA-o1 breaks down the answer into multiple reasoning components and uses inference-time scaling to optimize each stage.