#Clone endpoint failing in UI

28 messages · Page 1 of 1 (latest)

covert cliff
#
    "errors": [
        {
            "message": "Something went wrong. Please try again later or contact support.",
            "locations": [
                {
                    "line": 1,
                    "column": 23
                }
            ],
            "extensions": {
                "code": "BAD_USER_INPUT"
            }
        },
        {
            "message": "Something went wrong. Please try again later or contact support.",
            "locations": [
                {
                    "line": 1,
                    "column": 23
                }
            ],
            "extensions": {
                "code": "BAD_USER_INPUT"
            }
        }
    ]
}```


User input,sensitive information removed:
```{"operationName":"saveEndpoint","variables":{"input":{"gpuIds":"AMPERE_48,ADA_48_PRO,-NVIDIA A40,-NVIDIA L40","gpuCount":1,"allowedCudaVersions":"","idleTimeout":5,"locations":null,"name":"myendpoint-dev (cloned)","networkVolumeId":null,"scalerType":"QUEUE_DELAY","scalerValue":4,"workersMax":1,"workersMin":0,"executionTimeoutMs":600000,"template":{"containerDiskInGb":25,"containerRegistryAuthId":"","dockerArgs":"","env":[{},{],"imageName":"my-imaage","name":"endpoint-dev (cloned)__template"}}},"query":"mutation saveEndpoint($input: EndpointInput!) {\n  saveEndpoint(input: $input) {\n    gpuIds\n    id\n    idleTimeout\n    locations\n    name\n    networkVolumeId\n    scalerType\n    scalerValue\n    templateId\n    userId\n    workersMax\n    workersMin\n    gpuCount\n    __typename\n  }\n}"}```
#

seems to be related to "allowedCudaVersions" changing in some way

ripe harbor
#

I think its better to remove allowedCudaVersions from your payload than to leave it empty

covert cliff
#

Can’t control that because it is how the UI is built

ripe harbor
#

oh damn, yeah sorry oversight from my part, I think @dusky phoenix or @young meteor can help by pinging the UI dev(s).

young meteor
#

@covert cliff @past knoll Bug? I think if you want to provide more info @covert cliff can maybe put under #1185337232517759028 as a [Bug]

past knoll
#

@covert cliff can you share endpoint id? we can try to replicate using that

covert cliff
#

6fiz1j5a45xg0u

#

Looks like only reason why is because it now adds "allowedCudaVersions" :""

#

which is invalid graphql maybe?

past knoll
#

oh empty one is not allowed, @restive bluff

covert cliff
#

But also fails when you add fyi

#

see:

#

"allowedCudaVersions":"12.1,12.2"

restive bluff
#

hm, I can't seem to replicate this on my own, can you detail what steps in the UI you take to reproduce this issue?

#

empty string works just fine for me

#

doesn't seem to be allowedCudaVersions causing the issue

#

might be that the GPU IDs you're sending aren't quite right?

past knoll
#

is it possible you reached max workers?

ripe harbor
#

You never really know what the actual problem is with GraphQL, it never gives a useful error. It is a nightmare trying to debug GraphQL payloads.

covert cliff
covert cliff
#

And here is the exact request:

#

{"operationName":"saveEndpoint","variables":{"input":{"gpuIds":"AMPERE_48,ADA_48_PRO,-NVIDIA A40,-NVIDIA L40","gpuCount":1,"allowedCudaVersions":"","idleTimeout":5,"locations":null,"name":"facerecognition-dev (cloned)","networkVolumeId":null,"scalerType":"QUEUE_DELAY","scalerValue":4,"workersMax":1,"workersMin":0,"executionTimeoutMs":600000,"template":{"containerDiskInGb":5,"containerRegistryAuthId":"clhpo24ed0008le085dfb6npb","dockerArgs":"","env":[{"__typename":"EnvironmentVariable","key":"RUNPOD_BUCKET","value":"deep-test-bucket"},{"__typename":"EnvironmentVariable","key":"CREDS_PATH","value":"/app/creds.json"}],"imageName":"zachdeepshot/deep-facerecognition-service-dev:c44a24d9806136cb2df0a7a36e4d8bcfed5a8961-serverless","name":"facerecognition-dev (cloned)__template"}}},"query":"mutation saveEndpoint($input: EndpointInput!) {\n saveEndpoint(input: $input) {\n gpuIds\n id\n idleTimeout\n locations\n name\n networkVolumeId\n scalerType\n scalerValue\n templateId\n userId\n workersMax\n workersMin\n gpuCount\n __typename\n }\n}"}

#

and response

#
    "errors": [
        {
            "message": "Something went wrong. Please try again later or contact support.",
            "locations": [
                {
                    "line": 1,
                    "column": 23
                }
            ],
            "extensions": {
                "code": "BAD_USER_INPUT"
            }
        },
        {
            "message": "Something went wrong. Please try again later or contact support.",
            "locations": [
                {
                    "line": 1,
                    "column": 23
                }
            ],
            "extensions": {
                "code": "BAD_USER_INPUT"
            }
        }
    ]
}```
#

same response here so not a gpu is issue: {"operationName":"saveEndpoint","variables":{"input":{"gpuIds":"ADA_24","gpuCount":1,"allowedCudaVersions":"","idleTimeout":5,"locations":null,"name":"facerecognition-dev (cloned)","networkVolumeId":null,"scalerType":"QUEUE_DELAY","scalerValue":4,"workersMax":1,"workersMin":0,"executionTimeoutMs":600000,"template":{"containerDiskInGb":5,"containerRegistryAuthId":"clhpo24ed0008le085dfb6npb","dockerArgs":"","env":[{"__typename":"EnvironmentVariable","key":"RUNPOD_BUCKET","value":"deep-test-bucket"},{"__typename":"EnvironmentVariable","key":"CREDS_PATH","value":"/app/creds.json"}],"imageName":"zachdeepshot/deep-facerecognition-service-dev:c44a24d9806136cb2df0a7a36e4d8bcfed5a8961-serverless","name":"facerecognition-dev (cloned)__template"}}},"query":"mutation saveEndpoint($input: EndpointInput!) {\n saveEndpoint(input: $input) {\n gpuIds\n id\n idleTimeout\n locations\n name\n networkVolumeId\n scalerType\n scalerValue\n templateId\n userId\n workersMax\n workersMin\n gpuCount\n __typename\n }\n}"}

past knoll
#

@restive bluff did we get to the bottom of this?