Skip to content

Commit

Permalink
Chore: Add phi3 (#2914)
Browse files Browse the repository at this point in the history
* init

* version bump

* fix: correct template
  • Loading branch information
hahuyhoang411 committed May 16, 2024
1 parent 0436224 commit 2182599
Show file tree
Hide file tree
Showing 2 changed files with 33 additions and 1 deletion.
2 changes: 1 addition & 1 deletion extensions/inference-nitro-extension/package.json
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
{
"name": "@janhq/inference-nitro-extension",
"productName": "Nitro Inference Engine",
"version": "1.0.6",
"version": "1.0.7",
"description": "This extension embeds Nitro, a lightweight (3mb) inference engine written in C++. See https://nitro.jan.ai.\nAdditional dependencies could be installed to run without Cuda Toolkit installation.",
"main": "dist/index.js",
"node": "dist/node/index.cjs.js",
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,32 @@
{
"sources": [
{
"url": "https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-gguf/resolve/main/Phi-3-mini-4k-instruct-q4.gguf",
"filename": "Phi-3-mini-4k-instruct-q4.gguf"
}
],
"id": "phi3-3.8b",
"object": "model",
"name": "Phi-3 Mini",
"version": "1.0",
"description": "Phi-3 Mini is Microsoft's newest, compact model designed for mobile use.",
"format": "gguf",
"settings": {
"ctx_len": 4096,
"prompt_template": "<|user|>\n{prompt}<|end|>\n<|assistant|>\n",
"llama_model_path": "Phi-3-mini-4k-instruct-q4.gguf"
},
"parameters": {
"max_tokens": 4096,
"stop": ["<|end|>"]
},
"metadata": {
"author": "Microsoft",
"tags": [
"3B",
"Finetuned"
],
"size": 2320000000
},
"engine": "nitro"
}

0 comments on commit 2182599

Please sign in to comment.