Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Better ollama load balancing #1276

Draft
wants to merge 6 commits into
base: dev
Choose a base branch
from

Conversation

asedmammad
Copy link
Contributor

@asedmammad asedmammad commented Mar 24, 2024

Pull Request Checklist

  • Description: Briefly describe the changes in this pull request.
  • Changelog: Ensure a changelog entry following the format of Keep a Changelog is added at the bottom of the PR description.
  • Documentation: Have you updated relevant documentation?
  • Dependencies: Are there any new dependencies? Have you updated the dependency versions in the documentation?

Description

This introduces a load balancer scheme in ollama app of the backend, resolving issue #1081. The implementation allows setting different strategies in the .env file, with support for RoundRobin and WeightedRoundRobin currently available.


Changelog Entry

Added

  • Implemented load balancer scheme for ollama backend

  • Added RoundRobin and WeightedRoundRobin strategies

  • Configured strategies in .env.example file for easy customization

@asedmammad
Copy link
Contributor Author

asedmammad commented Mar 24, 2024

I can work on adding the ability to change the strategy, weight values and other parameters directly through the UI. This would allow users to easily configure the load balancer settings without needing to modify the .env file.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant