Discord? #5

yankscally · 2023-04-25T02:27:37Z

I know this is usually unnecessary, but I'd like to help out. Do you have a discord?

I have some ideas about datasets, training that I think could be very useful. I'm also a GDScript veteran at this point.

I've successfully and commercially used GPT2 models in 2019 - so I have experience with datasets.

minosvasilias · 2023-04-25T17:51:41Z

Hey, any ideas and contributions are appreciated!
Would ideally like to keep conversations public in GitHub issues if possible, so this can be followed by anyone interested.

If you do want a private chat though, you can reach me on Discord at markus_#2339 or on Twitter at @minosvasilias.

yankscally · 2023-06-03T14:25:25Z

Just got a chance to try out your model, and the results seem promising.

Here is a sample of a conversation I had with your model. It grasps basic GDScript terminology and syntax, but it seems to be setup wrong maybe in my webui.

what is the best way to use this model? I am using the 7B model locally on the oobabooga webui.

minosvasilias · 2023-06-03T17:34:18Z

The model is finetuned on an instruct-dataset similar to stanford-alpaca and similar models. This means all samples confirm to a specific prompting template, which in the case of godot-dodo is:

Below is an instruction that describes a GDScript coding task. 
Write code that appropriately completes the request.


### Instruction:
{instruction}

### Response:

I have not tested using the model without that format, and am not familiar with how oobabooga sets things up.
Very interesting to see it seemingly still retaining some capability of natural language conversations though, despite this fixed format + the very repetitive initial response token(s) (func xxx:) it's trained on.

So if you want to reproduce the model performance as i evaluated it, you will need to follow the exact prompting template above. You can do this via Google Colab using the Jupyter notebook linked in the readme: https://colab.research.google.com/github/minosvasilias/godot-dodo/blob/main/demo/inference_demo.ipynb

yankscally · 2023-06-04T15:31:04Z

ok! I set this up in the 'character' part right but the output still isn't full scripts, but I think the problem may be in these generation parameters... still figuring it out

minosvasilias · 2023-06-04T22:04:36Z

That looks sensible, though again not sure how exactly they format the context.

However, godot-dodo models are unlikely to generate full scripts anyway if you're looking for that. The training dataset is split into individual methods, and the model therefore learns to implement the instructions within the scope of a single method. It will rarely, if ever, exceed that scope.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discord? #5

Discord? #5

yankscally commented Apr 25, 2023

minosvasilias commented Apr 25, 2023

yankscally commented Jun 3, 2023

minosvasilias commented Jun 3, 2023

yankscally commented Jun 4, 2023

minosvasilias commented Jun 4, 2023 •

edited

Discord? #5

Discord? #5

Comments

yankscally commented Apr 25, 2023

minosvasilias commented Apr 25, 2023

yankscally commented Jun 3, 2023

minosvasilias commented Jun 3, 2023

yankscally commented Jun 4, 2023

minosvasilias commented Jun 4, 2023 • edited

minosvasilias commented Jun 4, 2023 •

edited