Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hey Guys, I Keep Getting

hey guys, I keep getting trains_agent: ERROR: Connection Error: it seems *api_server* is misconfigured. Is this the TRAINS API server http://apiserver:8008 ? for trains-agent-services container after updating to 0.17.0. even though it is running as a part of the docker-compose. any ideas? everything else seems to be working fine

  
  
Posted 3 years ago
Votes Newest

Answers 27


new version worked

  
  
Posted 3 years ago

Currently no, you need to set them to a set of credentials you created

  
  
Posted 3 years ago

well, the server wouldn't work without them?

  
  
Posted 3 years ago

I assume you've configured the TRAINS_API_ACCESS_KEY and TRAINS_API_SECRET_KEY env vars?

  
  
Posted 3 years ago

yeah, we did. let me check if explicitly setting credentials helps

  
  
Posted 3 years ago

do you have any idea why cleanup task keeps failing then (it used to work before the update)

  
  
Posted 3 years ago

Yeah, this looks like it did finally succeed connecting to the apiserver...

  
  
Posted 3 years ago

nice, thanks! I'll check if it solves the issue first thing tomorrow in the morning

  
  
Posted 3 years ago

Can you share the task log?

  
  
Posted 3 years ago

nope, old clenup task fails with trains_agent: ERROR: Could not find task id=e7725856e9a04271aab846d77d6f7d66 (for host: )
Exception: 'Tasks' object has no attribute 'id

weirdly enough, curl http://apiserver:8008 from inside the container works

  
  
Posted 3 years ago

It looks like it has no host definition

  
  
Posted 3 years ago

WARNING: You are using pip version 20.1.1; however, version 20.3.3 is available.
You should consider upgrading via the '/usr/bin/python3 -m pip install --upgrade pip' command.

trains_agent: ERROR: Connection Error: it seems *api_server* is misconfigured. Is this the TRAINS API server http://apiserver:8008 ?

http://OUR_IP:8081 http://OUR_IP:8080 http://apiserver:8008
WARNING: You are using pip version 20.1.1; however, version 20.3.3 is available.
You should consider upgrading via the '/usr/bin/python3 -m pip install --upgrade pip' command.

trains_agent: ERROR: Connection Error: it seems *api_server* is misconfigured. Is this the TRAINS API server http://apiserver:8008 ?

http://OUR_IP:8081 http://OUR_IP:8080 http://apiserver:8008
WARNING: You are using pip version 20.1.1; however, version 20.3.3 is available.
You should consider upgrading via the '/usr/bin/python3 -m pip install --upgrade pip' command.
Failed creating temporary copy of ~/.ssh for git credential

  
  
Posted 3 years ago

problem is solved. I had to replace /opt/trains/data/fileserver to /opt/clearml/data/fileserver in Agent configuration, and replace trains to clearml in Requirements

  
  
Posted 3 years ago

magic

  
  
Posted 3 years ago

SuccessfulKoala55 grrrrr it keeps happening, I have no idea what's wrong

  
  
Posted 3 years ago

thank you 😃

  
  
Posted 3 years ago

Just FYI, we've updated the agent service es docker image a week ago (along with the docker-compose.yml), you might want to try that out

  
  
Posted 3 years ago

can you share the entire output of the agent?

  
  
Posted 3 years ago

Are you using the default docker-compose, or an AMI?

  
  
Posted 3 years ago

default docker-compose

  
  
Posted 3 years ago

docker logs trains-agent-services

  
  
Posted 3 years ago

We're thinking about changing that

  
  
Posted 3 years ago

DilapidatedDucks58 I tested again, and I'm not sure there's a problem at all. I think what you saw might be a few errors thrown while the agent-services is waiting for the apiserver to start. I just tried a fresh install and the agent-services did appear in the Workers & Queues page. If you can, I'd like to see the full output of docker logs trains-agent-services

  
  
Posted 3 years ago

Hi DilapidatedDucks58 ,
That's strange since it's an internal docker-compose address... Let me try to reproduce... Do you have any specific change in your setup?

  
  
Posted 3 years ago

Is it the same error again? Can you send the agent services logs?

  
  
Posted 3 years ago

Hi DilapidatedDucks58 ,
Se it seems there's and issue where the agent-services can't resolve http://apiserver from within its container. Changing the default TRAINS_API_HOST in the docker-compose to http://localhost:8008 does the trick - we'll update the docker-compose.
Also, it seems I forgot that by default, the agent does not require special credentials and can use the default built-in credentials that exists in the server 🤭 - this is fine as long as you keep your server closed to the world (if you open it, you'll obviously want to change all default secrets and credentials anyway).

  
  
Posted 3 years ago

Also, did you use the agent-services before upgrading? (i'm trying to understand if there's a regression of some sort)

  
  
Posted 3 years ago
962 Views
27 Answers
3 years ago
one year ago
Tags
Similar posts