Run MakeBinaryPackage on r6i.4xlarge EC2 instances instead of m5.4xlarge #272

Atry · 2022-01-28T23:36:05Z

According to https://aws.amazon.com/ec2/instance-types/, r6i.4xlarge doubles the memory in comparison to m5.4xlarge. Hopefully it would reduce the out of memory errors.

Atry · 2022-01-29T00:16:02Z

I have deployed the step functions for test

Atry · 2022-01-29T00:19:54Z

I triggered a previously failed build bin/build-on-aws 2022.01.21 debian-11-bullseye for testing this PR.

Atry · 2022-01-29T01:13:17Z

Sorry, deploying the step functions do not take effect.

I just deployed the lambdas instead, and triggered a previously failed build bin/build-on-aws 2022.01.20 debian-11-bullseye for testing this PR.

Atry · 2022-01-29T01:48:21Z

I triggered a previously failed build bin/build-on-aws 2022.01.21 debian-11-bullseye for testing this PR.

The retry instance type is r6i.4xlarge since I have deployed new lambdas.

fredemmott · 2022-01-29T02:20:28Z

Nice, the m* used to be the high memory tier, and I assumed that had continued

Given we don't need these jobs to be super fast, we should also try lower options in the r6i range - changing the cores:RAM usage should fix the problem - it's not just a matter of more RAM.

Worth trying r6i.xlarge, and seeing what the build times are like.

--

As a concrete example, we were building with 32GB up until August last year (a4c60f8) until some OOMs started - doubling the cores and doubling the RAM did not fix the problem.

Atry · 2022-01-29T03:40:41Z

There is one retry attempt for the nightly build. The number of retry attempts is less than previous nightly builds.

Atry · 2022-01-29T03:51:19Z

Currently we limited the concurrency in #260.

Let's revert #260 for better CPU utilization

fredemmott · 2022-01-29T04:58:15Z

There is one retry attempt for the nightly build. The number of retry attempts is less than previous nightly builds.

If it still needs an OOM retry, the problem exists; sometimes, with the previous settings, it succeeds with 0 retries. "Fewer retries" is extremely low signal.

I think the next step is to enable atop to figure out what's going on; I suspect cargo may be parrelising to NCORES in combination with HHVM's own parellization, but we need more data

Atry · 2022-02-15T00:15:41Z

Even though it's a weak signal, we did find less failures recently. Shall we merge this PR?

Run MakeBinaryPackage on r6i.4xlarge EC2 instances instead of m5.4xlarge

958cbc1

facebook-github-bot added the CLA Signed label Jan 28, 2022

Atry mentioned this pull request Jan 29, 2022

Revert "Limit the Make concurrency to 8 threads" #273

Draft

Atry requested a review from fredemmott February 15, 2022 00:15

fredemmott merged commit e4c3de7 into master Feb 18, 2022

Atry deleted the r6i.4xlarge branch February 18, 2022 23:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Run MakeBinaryPackage on r6i.4xlarge EC2 instances instead of m5.4xlarge #272

Run MakeBinaryPackage on r6i.4xlarge EC2 instances instead of m5.4xlarge #272

Uh oh!

Atry commented Jan 28, 2022

Uh oh!

Atry commented Jan 29, 2022

Uh oh!

Atry commented Jan 29, 2022

Uh oh!

Atry commented Jan 29, 2022 •

edited

Loading

Uh oh!

Atry commented Jan 29, 2022

Uh oh!

fredemmott commented Jan 29, 2022

Uh oh!

Atry commented Jan 29, 2022

Uh oh!

Atry commented Jan 29, 2022 •

edited

Loading

Uh oh!

fredemmott commented Jan 29, 2022

Uh oh!

Atry commented Feb 15, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Run MakeBinaryPackage on r6i.4xlarge EC2 instances instead of m5.4xlarge #272

Run MakeBinaryPackage on r6i.4xlarge EC2 instances instead of m5.4xlarge #272

Uh oh!

Conversation

Atry commented Jan 28, 2022

Uh oh!

Atry commented Jan 29, 2022

Uh oh!

Atry commented Jan 29, 2022

Uh oh!

Atry commented Jan 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Atry commented Jan 29, 2022

Uh oh!

fredemmott commented Jan 29, 2022

Uh oh!

Atry commented Jan 29, 2022

Uh oh!

Atry commented Jan 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fredemmott commented Jan 29, 2022

Uh oh!

Atry commented Feb 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Atry commented Jan 29, 2022 •

edited

Loading

Atry commented Jan 29, 2022 •

edited

Loading

Atry commented Feb 15, 2022 •

edited

Loading