🧠 Document Memory with Guidance in Large Language Models

Memorizing Documents with Guidance in Large Language Models

🎯 Abstract

This paper introduces a novel approach to enhance document memory in large language models (LLMs) through guided learning mechanisms. We propose a document-wise memory selection framework that enables models to selectively memorize and retrieve document-specific information using learnable document representations and guidance loss functions.

🔑 Key Contributions

Framework: Novel document-wise memory selection mechanism
Guidance Loss: Innovative guidance-based training approach for document memory
Visualization: Comprehensive analysis of memory selection patterns
Evaluation: Extensive experiments across multiple model architectures

🔍 TL;DR

Large language models can be enhanced with document-wise memory selection using learnable document representations and guidance loss functions to improve document memorization and retrieval capabilities.

📁 Repository Structure

DocGuidanceLLM/
├── foundations/                # Model foundation implementations
│   ├── llama2.py              # Llama2 model utilities
│   └── pythia.py              # Pythia model utilities
├── document_memories.py        # Document memory implementation
├── hook_lm.py                 # Language model hooking utilities
├── train_guidance.py          # Main training script
├── utils.py                   # Utility functions and memory selection
├── wikitext.py                # WikiText dataset processing
├── run.sh                     # Experiment runner script
├── requirements.txt           # Python dependencies
└── README.md

🚀 Quick Start

Supported Models

The following models are supported in foundations/:

Llama2: Various sizes through llama2.py
Pythia: Various sizes through pythia.py

Running Experiments

1. Training with Document Memory Guidance

# Run the main training experiment
bash run.sh

# Or run with custom parameters
python train_guidance.py \
    --lm_name pythia \
    --lm_size 1b \
    --num_gpus 1 \
    --max_labels 10 \
    --segment_length 128 \
    --max_segements 10 \
    --max_length 256 \
    --lr 1e-3 \
    --batch_size 16 \
    --num_epochs 500 \
    --hook_memory_dim 32 \
    --hook_memory_layer 15 \
    --key_dim 2 \
    --key_activation tanh \
    --guidance 0.1

2. Experiment Configuration

Edit the parameters in run.sh to customize your experiments:

# --- LLM Related ---
lm_name=pythia 
lm_size=1b  
num_gpus=1

# --- Document Memory Related  ---  
key_dim=2           # dimension of random document representation
key_activation=tanh # inductive bias of document memory selection  
hook_memory_dim=32  # how many memories 
hook_memory_layer=15   # location of the memory  
guidance=0.1        # alpha (guidance parameter)

📊 Visualizing Document Selection

Please see utils.py for the implementation of memory selection.

Visualization of ReLU Activation

Visualization of Tanh Activation

📚 Citation

If you find this work useful, please cite our paper:

@inproceedings{park2024document,
  title={Memorizing Documents with Guidance in Large Language Models},
  author={Park, Bumjin and Choi, Jaesik},
  booktitle={Proceedings of the 33rd International Joint Conference on Artificial Intelligence (IJCAI)},
  year={2024}
}

📋 Requirements

Key dependencies include:

PyTorch
Transformers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Document Memory with Guidance in Large Language Models

🎯 Abstract

🔑 Key Contributions

🔍 TL;DR

📁 Repository Structure

🚀 Quick Start

Supported Models

Running Experiments

1. Training with Document Memory Guidance

2. Experiment Configuration

📊 Visualizing Document Selection

📚 Citation

📋 Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
assets		assets
foundations		foundations
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
document_memories.py		document_memories.py
hook_lm.py		hook_lm.py
run.sh		run.sh
train_guidance.py		train_guidance.py
utils.py		utils.py
wikitext.py		wikitext.py

Folders and files

Latest commit

History

Repository files navigation

🧠 Document Memory with Guidance in Large Language Models

🎯 Abstract

🔑 Key Contributions

🔍 TL;DR

📁 Repository Structure

🚀 Quick Start

Supported Models

Running Experiments

1. Training with Document Memory Guidance

2. Experiment Configuration

📊 Visualizing Document Selection

📚 Citation

📋 Requirements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages