-
Notifications
You must be signed in to change notification settings - Fork 779
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training code #138
Comments
Yes to instanciate a fresh model from te code and train a new model (after generating the initial config(ie Load with no weights).... This is needed to create a new model ( with a new tokenization process, ie : multimodal input ) ... so the ability to select which input pre processors / feature extractors are available ... As Speech input should be auto tokenized ... from transcribed to text to token_ID... as well as the image being returned to token _IDs also , for images the processor would process the image and convert to token ID... We need only to have a text output , as later we can create a wrapper for generation of sound and for images ... using the same sound but with diffusers to generate an output. the training process should use the diffusers to learn the images as well as by the captioning its description .... hence for later generation any pre captioned image should be able to be regenerated or a representation ! ... For sound input and generation , obviously speech output is no problem as the same library for speech also outputs speech, but we also need a sound generator for our generated outputs ie generate the sound of a sparrow(bird).... (another form of diffuser).... hence we need the start location ! the training script for a code model ... ? |
Hi
Where can I find the code needed to train the initial model and produce the model files?
The text was updated successfully, but these errors were encountered: