The paper proposes Geo-Llama, a framework for generating realistic synthetic human mobility trajectories with spatiotemporal constraints. The key highlights are:
Textual Encoding: Trajectories are represented as sequences of textual tokens, where each visit is encoded as "arrival time is t, location is l, duration is d".
Temporal-Order Permutation: To enable the LLM to capture spatiotemporal patterns regardless of the order of visits, the textual sequences are randomly permuted before fine-tuning.
Parameter-Efficient Fine-Tuning: The pre-trained LLM is fine-tuned using a parameter-efficient technique (LoRA) on the permuted textual data.
Controlled Generation: During generation, the fine-tuned LLM can generate trajectories based on random prompts (uncontrolled) or prompts representing spatiotemporal constraints (controlled).
Integrity Check: A post-processing step is performed to ensure the generated trajectories satisfy integrity constraints, such as no overlapping visits.
The experiments on real-world and synthetic datasets demonstrate that Geo-Llama outperforms existing methods in generating realistic trajectories, both in uncontrolled and controlled settings. It also exhibits superior data efficiency, maintaining high performance with limited training data.
إلى لغة أخرى
من محتوى المصدر
arxiv.org
الرؤى الأساسية المستخلصة من
by Siyu Li, Toa... في arxiv.org 09-12-2024
https://arxiv.org/pdf/2408.13918.pdfاستفسارات أعمق