Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Requests & Voting Hub #2302

Open
guolinke opened this issue Aug 1, 2019 · 43 comments
Open

Feature Requests & Voting Hub #2302

guolinke opened this issue Aug 1, 2019 · 43 comments

Comments

@guolinke
Copy link
Collaborator

guolinke commented Aug 1, 2019

This issue is to maintain all features request on one page.

Note to contributors: If you want to work for a requested feature, re-open the linked issue. Everyone is welcome to work on any of the issues below.

Note to maintainers: All feature requests should be consolidated to this page. When there are new feature request issues, close them and create the new entries, with the link to the issues, in this page. The one exception is issues marked good first issue...these should be left open so they are discoverable by new contributors.

Call for Voting

we would like to call the voting here, to prioritize these requests.
If you think a feature request is very necessary for you, you can vote for it by the following process:

  1. got the issue (feature request) number.
  2. search the number in this issue, check the voting of it exists or not.
  3. if the voting exists, you can add 👍 to that voting
  4. if the voting doesn't exist, you can create a new voting by replying to this thread, and add the number in the it.

Discussions


Efficiency related


Effectiveness related


Distributed platform and GPU (OpenCL-based and CUDA)


Maintenance

Python package:

R package:


New features

New algorithms:

Objective and metric functions:

Python package:

Dask:

R package:

New language wrappers:

Input enhancements:

@guolinke guolinke pinned this issue Aug 1, 2019
This was referenced Aug 1, 2019
@js850
Copy link
Contributor

js850 commented Oct 23, 2022

Add support for CRLF line endings or improve documentation and error message #5508

@thomaslundgaard
Copy link

Optimisations for Apple Silicon #3606

@antaradas94
Copy link

Add parameter to control maximum group size for Lambdarank  #5053

@chopeen
Copy link

chopeen commented Dec 21, 2022

Allow training without loading full dataset into memory #5094

@chopeen
Copy link

chopeen commented Dec 21, 2022

Support different data types (when load data from Python) #3459

@szjunma
Copy link

szjunma commented Feb 21, 2023

Add support for early stopping in Dask interface #3712

@vitorpbarbosa7
Copy link

Add Earth Mover Distance as objective metric to be optimized (maximized) #1256

@tim-habitat
Copy link

Apache Arrow seems to be gaining a lot of traction in the dataframe space.
We use polars and it would be great to be able to directly create a dataset from arrow format.
Also, pandas 2.0 will have arrow as a backend later this month .

@barynton
Copy link

barynton commented Mar 7, 2023

Conan installation support #5770

@sanurielf
Copy link

Add support for Multi-output regression #524

@vladv14
Copy link

vladv14 commented Mar 24, 2023

Provide access to the bin ids and bin upper bounds of the constructed dataset #5191

@onacrame
Copy link

Consider implementation of the sketchboost algorithm for multi output/multiclass setting. The current multiclass approach is highly ineffecient as a separate tree structure is required for each class. This approach significantly improves on training time and model size by allowing a single tree structure to handle many classes.

This is already implemented in the Py-Boost library.

https://arxiv.org/pdf/2211.12858.pdf

@borchero
Copy link
Collaborator

borchero commented Jul 20, 2023

I am currently working on Apache Arrow support and will likely open a PR next week :)

Update: Implementation in #6022

@kaizhu256
Copy link

WebAssembly support (#5372)

@jane-delaney
Copy link

Support monotone constraints with quantile objective #3371

@bhvieira
Copy link

Recalculate feature importance during the update process of a tree model / Calculate Gain Importance on Test Data (#2413)

@cgoo4
Copy link

cgoo4 commented Feb 23, 2024

Add R-package support for an early-stopping min_delta as implemented in Python #4580 and referenced in #2526.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests