As a user, I want to tune my vLLM configuration using only vLLM-native settings, so that my setup remains lightweight and free from external framework dependencies.
Therefore, we need to implement :
- a local backend
- add a config params to switch between Ray / Local
- adapt differents controllers to support local backend
As a user, I want to tune my vLLM configuration using only vLLM-native settings, so that my setup remains lightweight and free from external framework dependencies.
Therefore, we need to implement :