{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":764401124,"defaultBranch":"main","name":"llm-inference","ownerLogin":"OpenCSGs","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2024-02-28T02:15:07.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/153507210?v=4","public":true,"private":false,"isOrgOwned":true},"refInfo":{"name":"","listCacheKey":"v0:1712729827.0","currentOid":""},"activityList":{"items":[{"before":"05c1aa456c66fe213a0a20be8cb852c234e5e73b","after":"4ea759902cdaa07f37adc0510d13e5505afa1136","ref":"refs/heads/main","pushedAt":"2024-05-17T07:12:30.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"address apiserver to standalone file (#142)","shortMessageHtmlLink":"address apiserver to standalone file (#142)"}},{"before":"8819bc2dc1bc0820613f1fa19d35c77ef215b75c","after":"05c1aa456c66fe213a0a20be8cb852c234e5e73b","ref":"refs/heads/main","pushedAt":"2024-05-17T07:11:43.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"upgrade vllm to v0.4.1 (#143)","shortMessageHtmlLink":"upgrade vllm to v0.4.1 (#143)"}},{"before":"e3261de652f3784514a28425a46666462a70ee15","after":"8819bc2dc1bc0820613f1fa19d35c77ef215b75c","ref":"refs/heads/main","pushedAt":"2024-05-14T02:44:10.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"depenglee1707","name":"pengcaca","path":"/depenglee1707","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154987252?s=80&v=4"},"commit":{"message":"Update opt-125m default autoscaler parameters to help understand for end user (#138)\n\n* Update default autoscaler parameters to help understand\r\n\r\n* support set resource and scale for api server\r\n\r\n* update\r\n\r\n* update","shortMessageHtmlLink":"Update opt-125m default autoscaler parameters to help understand for …"}},{"before":"b141d5501af14415e46a365c77ce982bbdf68a64","after":"e3261de652f3784514a28425a46666462a70ee15","ref":"refs/heads/main","pushedAt":"2024-05-14T02:41:11.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"add description for TODO to avoid lost the context (#141)","shortMessageHtmlLink":"add description for TODO to avoid lost the context (#141)"}},{"before":"5c82751595b3d1e35d22e064394e56e2f5e05d36","after":"b141d5501af14415e46a365c77ce982bbdf68a64","ref":"refs/heads/main","pushedAt":"2024-05-14T02:25:28.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"fix ui concurrency setting (#140)","shortMessageHtmlLink":"fix ui concurrency setting (#140)"}},{"before":"ac8782fc55f0b2bddd6aa697f70985a407430161","after":"5c82751595b3d1e35d22e064394e56e2f5e05d36","ref":"refs/heads/main","pushedAt":"2024-05-14T01:45:28.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"upgrade to pydantic v2 -_- (#139)","shortMessageHtmlLink":"upgrade to pydantic v2 -_- (#139)"}},{"before":"832e172ee1182c1b5ea568840c0347495422dd5a","after":"ac8782fc55f0b2bddd6aa697f70985a407430161","ref":"refs/heads/main","pushedAt":"2024-05-08T09:07:54.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"depenglee1707","name":"pengcaca","path":"/depenglee1707","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154987252?s=80&v=4"},"commit":{"message":"Update files of auto scaling on k8s (#136)\n\n* update image\r\n\r\n* update ray cluster yaml\r\n\r\n* Update files for deploy on k8s for autoscaler","shortMessageHtmlLink":"Update files of auto scaling on k8s (#136)"}},{"before":"035c862282c35843653cd57e8c207d6afe627447","after":"832e172ee1182c1b5ea568840c0347495422dd5a","ref":"refs/heads/main","pushedAt":"2024-05-08T03:36:48.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"depenglee1707","name":"pengcaca","path":"/depenglee1707","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154987252?s=80&v=4"},"commit":{"message":"Serve run in thread (#135)","shortMessageHtmlLink":"Serve run in thread (#135)"}},{"before":"d7efb05e7bfc2934bf897546a98f6989437f92da","after":"035c862282c35843653cd57e8c207d6afe627447","ref":"refs/heads/main","pushedAt":"2024-05-08T00:27:39.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"depenglee1707","name":"pengcaca","path":"/depenglee1707","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154987252?s=80&v=4"},"commit":{"message":"Auto load models when api server start (#133)\n\n* Auto load models when api server start\r\n\r\n* update\r\n\r\n* update version\r\n\r\n* fix version conflict\r\n\r\n* update version\r\n\r\n* Update model scaling config\r\n\r\n* update version","shortMessageHtmlLink":"Auto load models when api server start (#133)"}},{"before":"5314a152c5123bcad650377d81a0156522a51a70","after":"d7efb05e7bfc2934bf897546a98f6989437f92da","ref":"refs/heads/main","pushedAt":"2024-05-06T06:47:27.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"depenglee1707","name":"pengcaca","path":"/depenglee1707","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154987252?s=80&v=4"},"commit":{"message":"upgrade ray 2.20.0 (#131)","shortMessageHtmlLink":"upgrade ray 2.20.0 (#131)"}},{"before":"d1b46683b33c447ae5a925986788f2812f3419a4","after":"5314a152c5123bcad650377d81a0156522a51a70","ref":"refs/heads/main","pushedAt":"2024-05-06T01:19:42.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"depenglee1707","name":"pengcaca","path":"/depenglee1707","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154987252?s=80&v=4"},"commit":{"message":"lock vllm and xformers version to fix conflict (#129)\n\n* lock vllm and xform version to fix conflict\r\n\r\n* add log to help debug","shortMessageHtmlLink":"lock vllm and xformers version to fix conflict (#129)"}},{"before":"1218017f30c824847f04c9384687259d31aed6f3","after":"d1b46683b33c447ae5a925986788f2812f3419a4","ref":"refs/heads/main","pushedAt":"2024-04-30T05:37:53.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"depenglee1707","name":"pengcaca","path":"/depenglee1707","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154987252?s=80&v=4"},"commit":{"message":"Add csg wukong model (#127)\n\n* add csg wukong model\r\n\r\n* update","shortMessageHtmlLink":"Add csg wukong model (#127)"}},{"before":"28e7e8392d7a1fb3345485e28deca56942f52b5a","after":"1218017f30c824847f04c9384687259d31aed6f3","ref":"refs/heads/main","pushedAt":"2024-04-29T05:52:51.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"enable tensor paramlism for deepspeed (#126)","shortMessageHtmlLink":"enable tensor paramlism for deepspeed (#126)"}},{"before":"cc10a27408206a26dc27ca452bf057264b1d7c43","after":"28e7e8392d7a1fb3345485e28deca56942f52b5a","ref":"refs/heads/main","pushedAt":"2024-04-28T23:42:14.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"depenglee1707","name":"pengcaca","path":"/depenglee1707","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154987252?s=80&v=4"},"commit":{"message":"add api in openai style (#125)","shortMessageHtmlLink":"add api in openai style (#125)"}},{"before":"2ed6bbf7665a211f229b0b56589001a83fddd6bb","after":"cc10a27408206a26dc27ca452bf057264b1d7c43","ref":"refs/heads/main","pushedAt":"2024-04-25T14:53:45.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"correct generated metric (#124)","shortMessageHtmlLink":"correct generated metric (#124)"}},{"before":"21bf28d8cba389f6658ad430bce7ea9a789c6f37","after":"2ed6bbf7665a211f229b0b56589001a83fddd6bb","ref":"refs/heads/main","pushedAt":"2024-04-24T06:11:16.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"avoid to invoke hf to speed up deployment process (#121)","shortMessageHtmlLink":"avoid to invoke hf to speed up deployment process (#121)"}},{"before":"707f1a2933e77f19b044bd08d3c28bf2a3aff246","after":"21bf28d8cba389f6658ad430bce7ea9a789c6f37","ref":"refs/heads/main","pushedAt":"2024-04-24T06:10:31.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"add llama3-8b from csghub (#122)","shortMessageHtmlLink":"add llama3-8b from csghub (#122)"}},{"before":"6ae456a295191c98cde86adf2a6061899f7ec964","after":"707f1a2933e77f19b044bd08d3c28bf2a3aff246","ref":"refs/heads/main","pushedAt":"2024-04-23T10:50:42.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"Load path model issue (#119)\n\n* baside \"transformer auto\", other integration cannot address local path of model\r\n\r\n* load model cannot work","shortMessageHtmlLink":"Load path model issue (#119)"}},{"before":"8845ee13f6ec0570e5b44ca30d366da02091b034","after":"6ae456a295191c98cde86adf2a6061899f7ec964","ref":"refs/heads/main","pushedAt":"2024-04-23T03:12:02.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"the pipeline integration cannot address pad_token/eos_token absent (#118)","shortMessageHtmlLink":"the pipeline integration cannot address pad_token/eos_token absent (#118"}},{"before":"5cc8a0aa7c76aa7ba6b3533dbd762546b53bb067","after":"8845ee13f6ec0570e5b44ca30d366da02091b034","ref":"refs/heads/main","pushedAt":"2024-04-23T01:04:26.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"support vllm on-fly generate params (#115)","shortMessageHtmlLink":"support vllm on-fly generate params (#115)"}},{"before":"4643962d75ce1ad9cf431f85d61fae00d7c7dc40","after":"5cc8a0aa7c76aa7ba6b3533dbd762546b53bb067","ref":"refs/heads/main","pushedAt":"2024-04-22T02:16:00.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"depenglee1707","name":"pengcaca","path":"/depenglee1707","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154987252?s=80&v=4"},"commit":{"message":"update doc for load model from local path (#114)","shortMessageHtmlLink":"update doc for load model from local path (#114)"}},{"before":"178dac7abfaafa5369d5bec6cdd6b52822f24803","after":"4643962d75ce1ad9cf431f85d61fae00d7c7dc40","ref":"refs/heads/main","pushedAt":"2024-04-22T01:06:34.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"Resolve merge conflict by incorporating both suggestions (#113)","shortMessageHtmlLink":"Resolve merge conflict by incorporating both suggestions (#113)"}},{"before":"8a267a69b8d10628d475fbab7e0e426b936aef22","after":"178dac7abfaafa5369d5bec6cdd6b52822f24803","ref":"refs/heads/main","pushedAt":"2024-04-21T02:37:56.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"change default static batch setting (#112)","shortMessageHtmlLink":"change default static batch setting (#112)"}},{"before":"011328103c76ce42bc685f52c27ca2618f41eb43","after":"8a267a69b8d10628d475fbab7e0e426b936aef22","ref":"refs/heads/main","pushedAt":"2024-04-19T01:32:19.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"single prompt will failed in streming (#111)","shortMessageHtmlLink":"single prompt will failed in streming (#111)"}},{"before":"7c62e51139e1c8bc2009c674ade2903974dd364d","after":"011328103c76ce42bc685f52c27ca2618f41eb43","ref":"refs/heads/main","pushedAt":"2024-04-18T13:07:14.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"simplify readme (#110)","shortMessageHtmlLink":"simplify readme (#110)"}},{"before":"03a70fc5121ced216e59b6ac2a8b579a5c2f832b","after":"7c62e51139e1c8bc2009c674ade2903974dd364d","ref":"refs/heads/main","pushedAt":"2024-04-18T13:05:55.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"recover the text-classification and summarization downstream task supporting (#109)","shortMessageHtmlLink":"recover the text-classification and summarization downstream task sup…"}},{"before":"9d8f3a008df02a76dd399df95e1cc045db6c6452","after":"03a70fc5121ced216e59b6ac2a8b579a5c2f832b","ref":"refs/heads/main","pushedAt":"2024-04-17T10:19:41.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"question-answer downstream task not work since the input-output format wrong (#108)","shortMessageHtmlLink":"question-answer downstream task not work since the input-output forma…"}},{"before":"0e1bd00a96e4ebee3556af91830b45a0a9cb3cc4","after":"9d8f3a008df02a76dd399df95e1cc045db6c6452","ref":"refs/heads/main","pushedAt":"2024-04-17T10:18:25.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"translation model broken since wrong handling of its output (#105)","shortMessageHtmlLink":"translation model broken since wrong handling of its output (#105)"}},{"before":"e953497f9263d5a69a94013e9a331024395e6687","after":"0e1bd00a96e4ebee3556af91830b45a0a9cb3cc4","ref":"refs/heads/main","pushedAt":"2024-04-17T10:17:51.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"fix llamacpp(gguf) broked by \"revision\" (#107)","shortMessageHtmlLink":"fix llamacpp(gguf) broked by \"revision\" (#107)"}},{"before":"6c90024d6b6d52efef53c4371f63cdec25e6224a","after":"e953497f9263d5a69a94013e9a331024395e6687","ref":"refs/heads/main","pushedAt":"2024-04-17T10:17:15.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"SeanHH86","name":null,"path":"/SeanHH86","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/154984842?s=80&v=4"},"commit":{"message":"Refine model config yamls (#106)\n\n* refine models\r\n\r\n* refine yamls for models","shortMessageHtmlLink":"Refine model config yamls (#106)"}}],"hasNextPage":true,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAETJ8CMwA","startCursor":null,"endCursor":null}},"title":"Activity · OpenCSGs/llm-inference"}