Support for OpenELM of Apple #6868

Ce-daros · 2024-04-24T08:10:16Z

Prerequisites

Please answer the following questions for yourself before submitting an issue.

I am running the latest code. Development is very rapid so there are no tagged versions as of now.
I carefully followed the README.md.
I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
I reviewed the Discussions, and have a new bug or useful enhancement to share.

Feature Description

Support for OpenELM of Apple

https://huggingface.co/apple/OpenELM-3B-Instruct/tree/main

ggerganov · 2024-04-24T09:49:58Z

Nice to see the LLaMA* idea implemented in these models:

joshcarp · 2024-04-25T16:05:13Z

Not sure if anyone is working on this yet but i'm happy to pick it up

mertbozkir · 2024-04-26T20:09:05Z

same for me as well, I can pick it up. I am trying to upload the model on ollama, but got NotImplementedError: Architecture 'OpenELMForCausalLM' not supported!

joshcarp · 2024-04-26T21:08:49Z

Okay update: It's more difficult as I first expected it and this is a new codebase so it's stumped me a bit.
OpenELM have defined the variables: a_min, a_max for scaling the attention heads and b_min and b_max for scaling the ffn. This is both captured in their config but you can also derive it from the equation:

Still attempting it but i don't think i'm gonna be done any time soon

Wladastic · 2024-04-27T23:08:39Z

@joshcarp
Waiting patiently to try it out hopefully 🤞
Are you having any troubles figuring it out?

joshcarp · 2024-04-28T13:38:34Z

Yeah, so basically, i think I can't figure out how to calculate the kqv offsets for the kqv tensor every layer.
I'd be lying if i didn't admit i've just been throwing spaghetti at the wall as shown my the git history on this: https://github.com/joshcarp/llama.cpp

I'm using this as a reference: https://huggingface.co/apple/OpenELM-270M/blob/main/modeling_openelm.py
https://github.com/apple/corenet/tree/main/mlx_examples/open_elm

If anyone else wants to implement this feel free

joshcarp · 2024-04-29T18:28:29Z

If anyone can help out: #6986

kevinsuo · 2024-05-10T13:52:11Z

Does this work? Did anyone try this?

https://huggingface.co/LiteLLMs/OpenELM-GGUF

Wladastic · 2024-05-10T15:29:16Z

Does this work? Did anyone try this?

https://huggingface.co/LiteLLMs/OpenELM-GGUF

Have you checked the files?
It specifically states that it is just a placeholder, there is no model

userforsource · 2024-05-10T17:40:28Z

Is it possible to have support forr openelm
I am curious for this because this can be run on mobile devices with less power don't know about the performance though. Compare to gemma:2b or phi3

Ce-daros added the enhancement New feature or request label Apr 24, 2024

ggerganov added the good first issue Good for newcomers label Apr 24, 2024

thinkverse mentioned this issue Apr 25, 2024

Add OpenELM ollama/ollama#3910

Open

ngxson mentioned this issue Apr 28, 2024

support for openelm apple #6960

Closed

joshcarp mentioned this issue Apr 29, 2024

Attempt at OpenElm #6986

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for OpenELM of Apple #6868

Support for OpenELM of Apple #6868

Ce-daros commented Apr 24, 2024

ggerganov commented Apr 24, 2024

joshcarp commented Apr 25, 2024

mertbozkir commented Apr 26, 2024

joshcarp commented Apr 26, 2024

Wladastic commented Apr 27, 2024

joshcarp commented Apr 28, 2024 •

edited

joshcarp commented Apr 29, 2024

kevinsuo commented May 10, 2024

Wladastic commented May 10, 2024

userforsource commented May 10, 2024

Support for OpenELM of Apple #6868

Support for OpenELM of Apple #6868

Comments

Ce-daros commented Apr 24, 2024

Prerequisites

Feature Description

ggerganov commented Apr 24, 2024

joshcarp commented Apr 25, 2024

mertbozkir commented Apr 26, 2024

joshcarp commented Apr 26, 2024

Wladastic commented Apr 27, 2024

joshcarp commented Apr 28, 2024 • edited

joshcarp commented Apr 29, 2024

kevinsuo commented May 10, 2024

Wladastic commented May 10, 2024

userforsource commented May 10, 2024

joshcarp commented Apr 28, 2024 •

edited