The paper proposes Geo-Llama, a framework for generating realistic synthetic human mobility trajectories with spatiotemporal constraints. The key highlights are:
Textual Encoding: Trajectories are represented as sequences of textual tokens, where each visit is encoded as "arrival time is t, location is l, duration is d".
Temporal-Order Permutation: To enable the LLM to capture spatiotemporal patterns regardless of the order of visits, the textual sequences are randomly permuted before fine-tuning.
Parameter-Efficient Fine-Tuning: The pre-trained LLM is fine-tuned using a parameter-efficient technique (LoRA) on the permuted textual data.
Controlled Generation: During generation, the fine-tuned LLM can generate trajectories based on random prompts (uncontrolled) or prompts representing spatiotemporal constraints (controlled).
Integrity Check: A post-processing step is performed to ensure the generated trajectories satisfy integrity constraints, such as no overlapping visits.
The experiments on real-world and synthetic datasets demonstrate that Geo-Llama outperforms existing methods in generating realistic trajectories, both in uncontrolled and controlled settings. It also exhibits superior data efficiency, maintaining high performance with limited training data.
Egy másik nyelvre
a forrásanyagból
arxiv.org
Mélyebb kérdések