Tool sometimes misinterprets multiple-choice questions #15

timpaul · 2024-04-23T08:27:16Z

Accurately interpreting multiple choice questions (beyond simple yes/no) is a challenge. Lets capture examples of the tool successfully and unsuccessfully doing this, to determine how we might improve the performance.

timpaul · 2024-04-23T08:29:19Z

Here's a partially successful example for this image:

The different options for question 20 in the doc have been correctly parsed, but the hint text was not.

timpaul · 2024-04-23T08:33:21Z

Another mostly successful example from the same form as above:

The options for question 23 in the form were correctly determined, as was the fact that only one response is allowed.

The conditional date fields were not picked up, but this isn't surprising as the multiple-choice component doesn't support them.

This is a good example of where you might choose to structure this question differently in the web version anyway, using multiple pages and routing.

timpaul · 2024-04-23T08:51:58Z

Here's an example of it getting it wrong, from the same form:

It made 2 errors:

It treated the hint text as the first option
It assumed only one response was allowed

What's interesting (and frustrating) is that the question is nearly identical to this one, which was successfully parsed.

It does occasionally get it right:

timpaul · 2024-04-23T09:27:39Z

Here's another example of a mostly successful extraction, from question 42 of this image:

The hint text isn't carried over, and is added to the question title.

timpaul · 2024-05-01T13:45:47Z

It's now getting an isolated version of this example right:

timpaul · 2024-05-02T07:25:06Z

Another fail, from this image:

It chose checkboxes instead of radios. I wonder if I can get it to understand the difference based on the hint text?...

timpaul · 2024-05-02T07:56:21Z

Yes, I can!

This was fixed in this commit by adding the following to the description text for the answer_type object in the schema:

If any part of the question contains text like 'Tick the boxes...' it's a multiple_choice question.

I'd tried a few other variants before finding one that worked, which is interesting. I think what made it work was the confidence of the statement. Saying if any part of the question, and that it is (rather than probably is). Also expressing it as a standalone sentence, rather than appending it as a clause to another sentence.

Notice that the question in the example doesn't contain the exact text that I cite in the schema, but it still matches.

timpaul changed the title ~~Measure and improve performance on multiple-choice questions~~ Tool sometimes misinterprets multiple-choice questions Apr 23, 2024

timpaul added the bug Something isn't working label Apr 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tool sometimes misinterprets multiple-choice questions #15

Tool sometimes misinterprets multiple-choice questions #15

timpaul commented Apr 23, 2024

timpaul commented Apr 23, 2024 •

edited

timpaul commented Apr 23, 2024 •

edited

timpaul commented Apr 23, 2024 •

edited

timpaul commented Apr 23, 2024 •

edited

timpaul commented May 1, 2024

timpaul commented May 2, 2024

timpaul commented May 2, 2024 •

edited

Tool sometimes misinterprets multiple-choice questions #15

Tool sometimes misinterprets multiple-choice questions #15

Comments

timpaul commented Apr 23, 2024

timpaul commented Apr 23, 2024 • edited

timpaul commented Apr 23, 2024 • edited

timpaul commented Apr 23, 2024 • edited

timpaul commented Apr 23, 2024 • edited

timpaul commented May 1, 2024

timpaul commented May 2, 2024

timpaul commented May 2, 2024 • edited

timpaul commented Apr 23, 2024 •

edited

timpaul commented Apr 23, 2024 •

edited

timpaul commented Apr 23, 2024 •

edited

timpaul commented Apr 23, 2024 •

edited

timpaul commented May 2, 2024 •

edited