| ─── Robotics ─── |
|
|
|
|
|
| See, Act, Adapt: Active Perception for Unsupervised Cross-Domain Visual Adaptation via Personalized VLM-Guided Agent |
arXiv |
2026 |
image, text |
text |
|
| ─── Agentic ─── |
|
|
|
|
|
| Personal AI Agent for Camera Roll VQA |
arXiv |
2026 |
image, text |
text |
Page |
| MyPCBench: A Benchmark for Personally Intelligent Computer-Use Agents |
arXiv |
2026 |
image, text |
text |
Page, Code |
| VisualClaw: A Real-Time, Personalized Agent for the Physical World |
arXiv |
2026 |
video, image, text |
text |
Page |
| iOSWorld: A Benchmark for Personally Intelligent Phone Agents |
arXiv |
2026 |
image, text |
text |
Page, Code |
| PersonaTree: Structured Lifecycle Memory for Person Understanding in LLM Agents |
arXiv |
2026 |
text |
text |
|
| Personal Visual Memory from Explicit and Implicit Evidence |
arXiv |
2026 |
image, text |
text |
Page |
| PersonalHomeBench: Evaluating Agents in Personalized Smart Homes |
arXiv |
2026 |
image, text |
text |
|
| OmniMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory |
arXiv |
2026 |
image, text |
text |
Code |
| PEARL: Personalized Streaming Video Understanding Model |
arXiv |
2026 |
video, text |
text |
Code |
| According to Me: Long-Term Personalized Referential Memory QA |
arXiv |
2026 |
image, text |
text |
Code |
| ASTRA-bench: Evaluating Tool-Use Agent Reasoning and Action Planning with Personal User Context |
arXiv |
2026 |
text |
text |
|
| LifeEval: A Multimodal Benchmark for Assistive AI in Egocentric Daily Life Tasks |
arXiv |
2026 |
video, text |
text |
|
| PersonaMem-v2: Towards Personalized Intelligence via Learning Implicit User Personas and Agentic Memory |
arXiv |
2025 |
text |
text |
Data |
| PersonaAgent: Bridging Memory and Action for Personalized LLM Agents |
arXiv |
2025 |
text |
text |
|
| ─── Unified Models ─── |
|
|
|
|
|
| TAMEing Long Contexts in Personalization: Towards Training-Free and State-Aware MLLM Personalized Assistant |
KDD |
2025 |
image, text |
image, text |
Code |
| UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens |
NeurIPS |
2025 |
image, text |
image, text |
Page |
| YoChameleon: Personalized Vision and Language Generation |
CVPR |
2025 |
image, text |
image, text |
Page |
| ─── Vision Language Model ─── |
|
|
|
|
|
| Personalize Your Large Vision-language Models With In-context Prompt Tuning |
ECCV |
2026 |
image, text |
text |
|
| Personal Visual Context Learning in Large Multimodal Models |
arXiv |
2026 |
video, image, text |
text |
Page |
| PersonaVLM: Long-Term Personalized Multimodal LLMs |
CVPR |
2026 |
image, text |
text |
Page, Code |
| PEARL: Personalized Streaming Video Understanding Model |
arXiv |
2026 |
video, text |
text |
Code |
| Ego: Embedding-Guided Personalization of Vision-Language Models |
arXiv |
2026 |
video, image, text |
text |
|
| Contextualized Visual Personalization in Vision-Language Models |
ICML |
2026 |
image, text |
text |
Page, Code |
| Online-PVLM: Advancing Personalized VLMs with Online Concept Learning |
arXiv |
2025 |
image, text |
text |
|
| MMPB: It's Time for Multi-Modal Personalization |
NeurIPS |
2025 |
image, text |
text |
Page |
| RePIC: Reinforced Post-Training for Personalizing Multi-Modal Language Models |
NeurIPS |
2025 |
image, text |
text |
Code |
| Training-Free Personalization via Retrieval and Reasoning on Fingerprints |
arXiv |
2025 |
image, text |
text |
|
| PVChat: Personalized Video Chat with One-Shot Learning |
arXiv |
2025 |
video, text |
text |
|
| Concept-as-Tree: Synthetic Data is All You Need for VLM Personalization |
arXiv |
2025 |
image, text |
text |
|
| Personalization Toolkit: Training Free Personalization of Large Vision Language Models |
arXiv |
2025 |
image, text |
text |
|
| Personalized Large Vision-Language Models |
arXiv |
2024 |
image, text |
text |
|
| MC-LLaVA: Multi-Concept Personalized Vision-Language Model |
arXiv |
2024 |
image, text |
text |
Code |
| Personalized Visual Instruction Tuning |
ICLR |
2025 |
image, text |
text |
|
| Retrieval-Augmented Personalization for Multimodal Large Language Models |
CVPR |
2025 |
image, text |
text |
Page, Code |
| MyVLM: Personalizing VLMs for user-specific queries |
ECCV |
2024 |
image, text |
text |
Page, Code |
| Yo'LLaVA: Your Personalized Language and Vision Assistant |
NeurIPS |
2024 |
image, text |
text |
Page, Code |
| ─── Large Language Models ─── |
|
|
|
|
|
| Evoking User Memory: Personalizing LLM via Recollection-Familiarity Adaptive Retrieval |
ICLR |
2026 |
text |
text |
|
| PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants |
ACL Findings |
2025 |
text |
text |
Paper |
| PersonaMem-v2: Towards Personalized Intelligence via Learning Implicit User Personas and Agentic Memory |
arXiv |
2025 |
text |
text |
Data |
| Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale |
COLM |
2025 |
text |
text |
|
| Scaling Synthetic Data Creation with 1,000,000,000 Personas |
arXiv |
2024 |
text |
text |
|
| Personalized Large Language Models |
ICDMw |
2024 |
text |
text |
|
| LaMP: When Large Language Models Meet Personalization |
ACL |
2024 |
text |
text |
Page, Code |
| Learning to Predict Persona Information forDialogue Personalization without Explicit Persona Description |
ACL |
2023 |
text |
text |
|
| Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge |
AAAI |
2022 |
text |
text |
Code |
| A Personalized Dialogue Generator with Implicit User Persona Detection |
COLING |
2022 |
text |
text |
|
| Personalizing Dialogue Agents: I have a dog, do you have pets too? |
ACL |
2018 |
text |
text |
|