Contributors mailing list archives

contributors@odoo-community.org

Browse archives

Re: runbot is down

Tecnativa. S. L., Pedro M. Baeza

- 11/10/2016 13:13:45

Wow, that seems a lot of work. Thanks for making it!

I'm in my side fighting with MQT for making all in v10 work (Transifex stage is still making some noise).

Regards.

2016-10-11 13:07 GMT+02:00 Alexandre Fayolle <alexandre.fayolle@camptocamp.com>:

On 11/10/2016 10:38, Stéphane Bidoul wrote:
> Hi Alexandre,
> 
> "internal postgres corruption" looks pretty scary. Any idea on the root
> cause or lesson we can learn here?

Maybe the lesson is "be careful when running 1500 databases on your
postgresql cluster with lots of simultaneous connections". And the
second lesson is that the cleanup in the runbot is not very
good/efficient/robust (I'm not sure exactly, but it seems to leave a lot
of crap behind).

I had to fight against systemd which was finding that PG was taking too
much time to startup (replaying the WAL / rebuilding some internal data
structurs was taking a bit of time), and would issue a kill -9, which
was *not* a helpful way of solving things.

In the end I:

* manually cleaned up all the builds
* manually dropped all the databases
* rebooted the servers
* DELETEd the runbot.builds related to the heads of the main branches so
that the rebuild would work correctly by recreating a build environment
from github)

And since this was still failing in lots of cases, I just went through a
2h pdb session to find out a missing fix which I just applied on the
runbot and seems to fix the builds on the v10 branch (this was
introduced by the merge of the upstream branch of odoo-extra in our
runbot)...

Things seem to be getting back to normal now.

-- 
Alexandre Fayolle
Chef de Projet
Tel : +33 4 58 48 20 30

Camptocamp France SAS
Savoie Technolac, BP 352
73377 Le Bourget du Lac Cedex
http://www.camptocamp.com

_______________________________________________
Mailing-List: http://odoo-community.org/groups/contributors-15
Post to: mailto:contributors@odoo-community.org
Unsubscribe: http://odoo-community.org/groups?unsubscribe

Reference

runbot is down
by

Camptocamp France SAS, Alexandre Fayolle

- 10/10/2016 18:27:25 - 0
10 replies 10 replies
- Re: runbot is down
  by
  
  Camptocamp France SAS, Alexandre Fayolle
  
  - 12/10/2016 08:26:10 - 0
- Re: runbot is down
  by
  
  Camptocamp SA, Joël Grand Guillaume
  
  - 11/10/2016 22:47:09 - 0
- Re: runbot is down
  by
  
  Hunki Enterprises B.V., Holger Brunn
  
  - 11/10/2016 16:03:04 - 1
- Re: runbot is down
  by
  
  Acsone SA/NV, Stéphane Bidoul
  
  - 11/10/2016 14:19:44 - 0
- Re: runbot is down
  by
  
  Camptocamp France SAS, Alexandre Fayolle
  
  - 11/10/2016 13:27:26 - 0
- Re: runbot is down
  by
  
  Tecnativa. S. L., Pedro M. Baeza
  
  - 11/10/2016 13:13:45 - 0
- Re: runbot is down
  by
  
  Camptocamp France SAS, Alexandre Fayolle
  
  - 11/10/2016 13:00:56 - 0
- Re: runbot is down
  by
  
  Acsone SA/NV, Stéphane Bidoul
  
  - 11/10/2016 09:33:04 - 0
- Re: runbot is down
  by
  
  Camptocamp France SAS, Alexandre Fayolle
  
  - 11/10/2016 09:08:27 - 0
- Re: runbot is down
  by
  
  Camptocamp France SAS, Alexandre Fayolle
  
  - 10/10/2016 18:40:31 - 0

Contributors mailing list archives

contributors@odoo-community.org

Browse archives

Re: runbot is down

Re: runbot is down

Re: runbot is down

Reference

runbot is down

Re: runbot is down

Re: runbot is down

Re: runbot is down

Re: runbot is down

Re: runbot is down

Re: runbot is down

Re: runbot is down

Re: runbot is down

Re: runbot is down

Re: runbot is down

GET INVOLVED

APP STORE

GITHUB

PRIVACY

CONTACT