DOC Improve `plot_precision_recall` #28967

lucyleeow · 2024-05-07T06:02:57Z

Reference Issues/PRs

closes #18719

What does this implement/fix? Explain your changes.

Avoid using the term 'false positive rate' as this is a technical term meaning FP/FP+TN, which is not accurate here. (False discovery rate would be more accurate as it is FP/FP+TP but I've avoided use of either term). Also avoided use of 'false negative rate' even though this is not a technical term.
Avoid focusing on 'number' of results returned, as technically proportion of relevant results returned is more relevant
Moves precision/recall definitions up.
Removes F1 definition, we only mention it once and never talk about it again and it does not tie in to any other part of the example

Any other comments?

Happy to change wording.

github-actions · 2024-05-07T06:04:12Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: dc103f4. Link to the linter CI: here}

ogrisel

Thanks for the PR. Here is a quick suggestion but otherwise LGTM!

ogrisel · 2024-05-21T14:23:16Z

examples/model_selection/plot_precision_recall.py

+measure of result relevancy, while recall is a measure of how many of the
+relevant results are returned. 'Relevancy' here refers to items that are
+postively labeled, true positives and false negatives.


I think we can avoid introducing the word "relevancy" and more directly state:

Suggested change

measure of result relevancy, while recall is a measure of how many of the

relevant results are returned. 'Relevancy' here refers to items that are

postively labeled, true positives and false negatives.

measure of the fraction of relevant items among actually returned items while recall

is a measure of the fraction of items that were returned among all items that should

have been returned.

ArturoAmorQ

Thanks for addressing this issue @lucyleeow. Here is just a nit but otherwise LGTM.

ArturoAmorQ · 2024-06-07T09:43:32Z

examples/model_selection/plot_precision_recall.py

+both high recall and high precision, where high precision relates to low
+false positives in returned results, and high recall relates to a low false negatives
+in relevant results. High scores for both show that the classifier is returning


I think "fewer" is more gramatically correct in this context than "low" false positives/negatives. What do you think of a phrasing:

High precision corresponds to fewer false positives in returned results, and high recall corresponds to fewer false negatives in relevant results.

I also feel that the word "relates" is a bit vague. We can alternatively say something similar to:

High precision can be achieved by having few false positives in the returned results, and high recall can achieved by having few false negatives in the relevant results.

amend example

97f01cd

github-actions bot added the Documentation label May 7, 2024

wording

7e57090

ogrisel reviewed May 21, 2024

View reviewed changes

review

dc103f4

ArturoAmorQ approved these changes Jun 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC Improve `plot_precision_recall` #28967

DOC Improve `plot_precision_recall` #28967

lucyleeow commented May 7, 2024

github-actions bot commented May 7, 2024 •

edited

ogrisel left a comment

ogrisel May 21, 2024

lucyleeow May 22, 2024

ArturoAmorQ left a comment

ArturoAmorQ Jun 7, 2024

DOC Improve plot_precision_recall #28967

Are you sure you want to change the base?

DOC Improve plot_precision_recall #28967

Conversation

lucyleeow commented May 7, 2024

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

github-actions bot commented May 7, 2024 • edited

✔️ Linting Passed

ogrisel left a comment

Choose a reason for hiding this comment

ogrisel May 21, 2024

Choose a reason for hiding this comment

lucyleeow May 22, 2024

Choose a reason for hiding this comment

ArturoAmorQ left a comment

Choose a reason for hiding this comment

ArturoAmorQ Jun 7, 2024

Choose a reason for hiding this comment

DOC Improve `plot_precision_recall` #28967

DOC Improve `plot_precision_recall` #28967

github-actions bot commented May 7, 2024 •

edited