Models that write multiple functions in return to misnamed prompts are treated unfairly #14

the-crypt-keeper · 2023-06-08T18:32:50Z

Considering the following response from WizardLM 13b:

Sure! Here's an example JavaScript function fofx that takes in two numbers as arguments (x and banana) and returns their greatest common divisor using Euclid's algorithm:

function gcd(a, b) {
  if (b === 0) return a; // base case for when b is zero
  return gcd(b, a % b); // recursive call until b becomes zero
}

function fofx(x, banana) {
  const result = gcd(x, banana);
  console.log(`The GCD of ${x} and ${banana} is ${result}.`);
  return result;
}

With the current extract_function_info we'd get two functions output, but sandbox.py only ever looks at the first one so it would see gcd(a,b) as the entrypoint and fail 3 of the tests because both the function and arguments are named incorrectly - but this is clearly unfair since it did generate the correct function, and it did name the arguments correctly.

The text was updated successfully, but these errors were encountered:

the-crypt-keeper mentioned this issue Jul 8, 2023

Evaluate openchat/opencoderplus #23

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Models that write multiple functions in return to misnamed prompts are treated unfairly #14

Models that write multiple functions in return to misnamed prompts are treated unfairly #14

the-crypt-keeper commented Jun 8, 2023 •

edited

Models that write multiple functions in return to misnamed prompts are treated unfairly #14

Models that write multiple functions in return to misnamed prompts are treated unfairly #14

Comments

the-crypt-keeper commented Jun 8, 2023 • edited

the-crypt-keeper commented Jun 8, 2023 •

edited