s3: You Don’t Need That Much Data to Train a Search Agent via RL 단 2,400개 샘플로 17만 개 샘플 성능 압도: 70배 효율성의…