Popular repositories Loading
-
DPMM-COT
DPMM-COT PublicForked from shimurenhlq/DPMM-COT
The code of paper "Multi-modal Latent Space Learning for Chain-of-Thought Reasoning in Language Models"
Python
-
LightRAG
LightRAG PublicForked from HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Python
-
visualwebarena
visualwebarena PublicForked from web-arena-x/visualwebarena
VisualWebArena is a benchmark for multimodal agents.
Python
-
SeeAct
SeeAct PublicForked from OSU-NLP-Group/SeeAct
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
Python
-
SeeClick
SeeClick PublicForked from njucckevin/SeeClick
The model, data and code for the visual GUI Agent SeeClick
HTML
-
Mind2Web
Mind2Web PublicForked from OSU-NLP-Group/Mind2Web
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"
Jupyter Notebook
If the problem persists, check the GitHub status page or contact support.