AstraFlow is a dataflow-oriented reinforcement learning system designed for better flexibility and scalability. AstraFlow natively supports the following for LLM RL training without any ...