Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions
The author introduces ReimaginedAct, a method for text-to-pose video editing that predicts human action changes from text prompts, questions, or counterfactual queries. By combining video understanding, reasoning, and editing modules, the approach achieves effective action editing and imaginary scenarios.