Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug/Assistance] os-std某一条数据报错Worker not responding #105

Open
Xccanxin opened this issue Jan 15, 2024 · 1 comment
Open

[Bug/Assistance] os-std某一条数据报错Worker not responding #105

Xccanxin opened this issue Jan 15, 2024 · 1 comment
Labels
bug Something isn't working help wanted Extra attention is needed

Comments

@Xccanxin
Copy link

Q:os数据集中os-std-003-ac-00000数据已经重复尝试几次,一直提示Worker not responding,其余143条数据均能正常评估,报错信息如下
Warning: chatglm2-6b/os-std#std-003-ac-00000 failed with error INTERACT_FAILED {"detail":"Error: Worker not responding\n"} index=None status=<SampleStatus.RUNNING: 'running'> result=None history=None

模型:chatglm2 + fastchat

已尝试将start_task.yaml中的worker数调成1或5,报错不变
start_task.yaml

definition:
  import: tasks/task_assembly.yaml

start:
  os-std: 1

default.yaml

import: definition.yaml

concurrency:
  task:
    os-std: 1
  agent:
    chatglm2-6b: 1

assignments: # List[Assignment] | Assignment
  - agent: # "task": List[str] | str ,  "agent": List[str] | str
      - chatglm2-6b
    task:
      - os-std

output: "outputs/{TIMESTAMP}"

报错数据:

    {
        "description": "Tell me the number of CPUs.",
        "evaluation": {
            "check": [
                null,
                {
                    "language": "python",
                    "file": "check/integer-match.py"
                }
            ],
            "example": "nproc"
        },
        "labels": [
            "command",
            "CPU",
            "device",
            "hardware",
            "processor",
            "system"
        ]
    },
@Xccanxin Xccanxin added bug Something isn't working help wanted Extra attention is needed labels Jan 15, 2024
@zhc7
Copy link
Collaborator

zhc7 commented Feb 2, 2024

Hi, @Xccanxin 这种情况有可能是由于worker卡死了,有可能agent给出的指令包含无限循环或者非常长时间的等待

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants