AIChat.app version 1.1
- Added local model selector dialog, trying to find GGUF model files you might have downloaded with HuggingFace or LM Studio.
- Added a HuggingFace model browser and downloader.
- Improved calculations for context size. Added settings for reduced kv cache size, making even more context size available per RAM.
- Added 10 min inactivity setting to llama-server to unload the model and free the memory - now the app can be left open in the background and will not wire the RAM permanently
- update llama-server to b9204
AIChat now requires macOS 14.6. The release is only for Apple Silicon macs (arm64). Intel mac support dropped - you cannot get a reasonable inference speeds on these machines.