מושגי ליבה
Hulk is a versatile model that unifies diverse human-centric tasks without task-specific finetuning.
תקציר
The content introduces Hulk, a multimodal human-centric perceiver capable of handling various tasks without task-specific adaptation. It discusses the challenges in developing a generalist model and outlines the architecture of Hulk, including tokenizers, transformers, and objective functions. The training datasets and evaluation metrics are also detailed.
JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2021
- Introduction to Hulk as a universal knowledge translator for human-centric tasks.
- Challenges in developing a generalist human-centric perceiver.
- Architecture of Hulk including tokenizers and transformers.
- Training datasets and evaluation metrics for assessing performance.
סטטיסטיקה
Hulkは、12のベンチマークで11つの最先端パフォーマンスを達成しました。
CrowdHumanデータセットでのPedestrian Detectionにおいて、mAPは77.5です。
COCOデータセットでの2D Pose Estimationにおいて、APは85.3です。
ציטוטים
"Human-centric perception tasks have wide industrial applications."
"Hulk pushes the limits on various human-centric tasks."