feat(docs): RFC on Searching for Focus Component on UI #469

LaPetiteSouris · 2023-08-17T11:49:07Z

What kind of change does this PR introduce?

An RFC in attempt to solve #421 and to provide a direction for #442

Summary

When working on #421, I realize that there are a few missing pieces of the puzzle in an attempt to evaluate a prediction of the models. Mainly, we do not have yet the definition of "how good and useful" a prediction is. Without this, it is hard to evaluate and see if a model provides correct prediction or not.

Somehow, this problem is entangled with the fact that we need to come up with a prompt strategy and a strategy to summarize a window data into useful hints for the models.

This PR provides a baseline for discussion on how we can decompose the windows into useful elements which later can serve not only model evaluation process, but also potentially prompt building as well as to some extend, RFLHF process.

Checklist

My code follows the style guidelines of OpenAdapt
I have performed a self-review of my code
If applicable, I have added tests to prove my fix is functional/effective
I have linted my code locally prior to submission
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation (e.g. README.md, requirements.txt)
New and existing unit tests pass locally with my changes

How can your code be run and tested?

The core RFC file should be reviewed/discussed. No code is written yet at the moment

Other information

Not sure where to put the RFC file, hence I created it in openadapt.docs.
It is more convenient to just read the RFC file

* Add 2 columns in screenshot table to store png_diff_data and png_mask_diff_data. * CRUD now supports calculation and save screenshots diff data on the flight.

* SAVE_SCREENSHOT_DIFF indicates that 2 neighbors screenshot will be compared and the difference will be saved to db

abrichr · 2023-08-28T15:20:18Z

Thank you @LaPetiteSouris ! As per our conversation last week, this sounds directionally correct. Before implementing, can you please coordinate with @FFFiend to parallelize the work? 🙏

The first step of the problem formulation is:

At a single time step, given an action and a tree, find the target in the tree

@FFFiend can you please suggest some next steps once this has been implemented?

LaPetiteSouris · 2023-08-28T16:39:41Z

Thanks

With pleasure. This week I'll come up with a kind of skeleton code proposals to make sure the this is compatible with future fine-tuning, completion provider. If @FFFiend has time, please be one of the reviewers for that coming code/skeleton proposal for the next steps.

@FFFiend how far are you off to finish FineTuning. Can I wrap up another round of review ?

I would love to review CompletionProvider as well when it is available.

FFFiend · 2023-08-28T19:15:07Z

Thanks

With pleasure. This week I'll come up with a kind of skeleton code proposals to make sure the this is compatible with future fine-tuning, completion provider. If @FFFiend has time, please be one of the reviewers for that coming code/skeleton proposal for the next steps.

@FFFiend how far are you off to finish FineTuning. Can I wrap up another round of review ?

I would love to review CompletionProvider as well when it is available.

#379 is ready for review. All done. Fine-tuning (#453 is too, however based on the discussion here and our steps moving forward, I think that PR is an example of a failure mode of fine-tuning, i.e showing the performance of the Davinci GPT model on the basic bare-bones iteration of Action, Window pairs)

LaPetiteSouris added 11 commits July 9, 2023 17:21

feat(crud): Compute and save screenshot diff

e2d30df

* Add 2 columns in screenshot table to store png_diff_data and png_mask_diff_data. * CRUD now supports calculation and save screenshots diff data on the flight.

feat(config): Add SAVE_SCREENSHOT_DIFF environment variable

cb14ec3

* SAVE_SCREENSHOT_DIFF indicates that 2 neighbors screenshot will be compared and the difference will be saved to db

Merge remote-tracking branch 'upstream/main'

f5047e8

Merge remote-tracking branch 'upstream/main'

35fb336

Merge remote-tracking branch 'upstream/main'

27db3ec

Merge remote-tracking branch 'upstream/main'

c6712b6

feat(crud): add missing import after merge

09281c1

refactor(crud): add missing type annotations

a7f6542

refactor(crud): add missing type annotations

79a8cce

Merge remote-tracking branch 'upstream/main'

05d9999

feat(docs) RFC on model evaluation

fcc6543

LaPetiteSouris mentioned this pull request Sep 14, 2023

feat: UI structured representation #495

Draft

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(docs): RFC on Searching for Focus Component on UI #469

feat(docs): RFC on Searching for Focus Component on UI #469

LaPetiteSouris commented Aug 17, 2023

abrichr commented Aug 28, 2023

LaPetiteSouris commented Aug 28, 2023 •

edited

FFFiend commented Aug 28, 2023

feat(docs): RFC on Searching for Focus Component on UI #469

Are you sure you want to change the base?

feat(docs): RFC on Searching for Focus Component on UI #469

Conversation

LaPetiteSouris commented Aug 17, 2023

abrichr commented Aug 28, 2023

LaPetiteSouris commented Aug 28, 2023 • edited

FFFiend commented Aug 28, 2023

LaPetiteSouris commented Aug 28, 2023 •

edited