[http] add http connection pool idle by klarose · Pull Request #5631 · envoyproxy/envoy

klarose · 2019-01-16T22:16:19Z

Description:
This adds the concept of a connection pool going "idle", implementing it for http1. A connection pool is idle if it has neither pending requests nor active requests. A user of a connection pool can ask it to
invoke a callback when the pool goes idle. A list of callbacks is maintained.

This is intended to be used to inform a mapping class when a connection pool is no longer used. The mapping class will use this information to help enforce a limit on the number of concurrent mapped connection pools. See #5337 (comment)

Risk Level: Low
Testing: Unit testing.
Docs Changes: None

We want to know when a connection pool is no longer processing requests. This adds a callback mechanism so consumers may be informed when a connection pool transitions into this state. Signed-off-by: Kyle Larose <kyle@agilicus.com>

klarose · 2019-01-16T22:18:03Z

My next pull request will implement this for http2. I've left it out of this PR so I can get feedback on the overall approach early, and to keep the PR small.

My plan for http2 is to use the following logic to decide whether the pool is idle:

   (primary || primary->numActiveRequests() == 0) &&
   (draining || draining->numActiveRequests() == 0) &&
   pending_requests.empty()

After that I'll implement the mapping class, then hook it in.

lizan · 2019-01-16T22:58:58Z

  std::list<ActiveClientPtr> ready_clients_;
  std::list<ActiveClientPtr> busy_clients_;
  std::list<DrainedCb> drained_callbacks_;
+  std::list<IdleCb> idle_callbacks_;


it might be overkill, did you consider using Common::CallbackManager?

Makes sense. I replaced idle_callbacks_ with that. I considered replacing drained_callbacks_, but it has some logic dependent on whether the list is empty, and that wasn't exposed by common::CallbackManager. Can probably be added in another PR.

Returning the same decoder caused problems. Signed-off-by: Kyle Larose <kyle@agilicus.com>

Signed-off-by: Kyle Larose <kyle@agilicus.com>

We were actually running a fucntion called from a parent class. Whoops. Signed-off-by: Kyle Larose <kyle@agilicus.com>

lizan

LGTM modulo one nit.

lizan · 2019-01-17T19:06:50Z

  // Http::ConnPoolImplBase
  void checkForDrained() override;
+  void checkForIdle() override { /* TODO(klarose): implement */
+  }


nit: NOT_IMPLEMENTED_GCOVR_EXCL_LINE

This is actually being invoked by the base class when a pending request is cancelled. I had this in originally, but it failed because this line panicked in a UT.

mattklein123

A question to get started. Thank you.

mattklein123 · 2019-01-17T19:46:27Z

  request.removeFromList(pending_requests_);
  host_->cluster().stats().upstream_rq_cancelled_.inc();
  checkForDrained();
+  checkForIdle();


Is it possible to merge checkForDrained() and checkForIdle() into a single function? I see there is some boolean logic below, but I think it should be possbile and would make the code less fragile. In general FAICT drained and idle are basically the same thing, but the intention of the callback is the same? Could we just use a single callback system and a different registration callback for your purpose and not have this new callback type at all?

Are you thinking of a callback object with something like onDrained vs onIdle?

Then, the checkForX function would be along the lines of:

if(drainedCondition): for each callback: callback.onDrained if (idleCondition): for each callback: callback.onIdle

What I'm asking is whether this change is needed. AFAICT idle and drained are the same thing. Can't you just register for a drained callback in the place where you want to do something when it's idle?

My concern is that adding a drained callback has side effects -- namely, when the pool goes idle, it will actively close all of its upstream connections. My vision here is for the upstreams to stay active until we need to actually free up connection pools for other hash key values.

But, in the short term, it's probably not a big deal. I had planned on allocating and freeing them as necessary in the first iteration. But, now that I think about it, a better approach may be to only register the drained callback when there is pressure from the constraints.

My original thought was along these lines:

ConnectionPool* assignNewPool(hash) { pool = getActivePool(hash) if pool { return pool; } pool = getIdlePool(); if (!pool) { pool = allocateNewPoolIfPossible(); } if (!pool) { // out of resources return nullptr; } pool->addIdleCallback([&]() {this->poolIdle(hash, pool); }); return pool; } void poolIdle(hash, pool) { moveFromActiveToIdle(hash, pool); }

I was worried that the act of registering a drained callback would be problematic. But, maybe it's not. Here's an alternative:

ConnectionPool* assignNewPool(hash) { pool = getActivePool(hash) if pool { return pool; } pool = getIdlePool(); if (!pool) { pool = allocateNewPoolIfPossible(); } if (!pool) { for each activePool { activePool->addDrainedCallback([&](){this->poolIdle(hash, pool);}); } // at this point, any pools which are currently idle will have invoked poolIdle. return getIdlePool(); } return pool; } void poolIdle(hash, pool) { moveFromActiveToIdle(hash, pool); }

I think I'll park this idle concept for now, and try what I've just suggested.

One thing I like about my "new" approach a bit better is that it could allow for some hysteresis on the draining. Start cleaning up pools if we cross a threshold, but before we run out, and stop cleaning them up when we cross below another. This would require that we add the ability to remove a drained callback, but we'll probably want that anyway.

I see what you are saying. I guess I would say two things:

I would probably start simple and just use the drained callback for now, and we can optimize later if needed.

if we do need to optimize, I think the implementation internally can actually share the same check idle/drained logic and just use a boolean to determine whether to close active connections or not. That would reduce the logic duplication.

WDYT?

Agreed. I'll close this.

klarose · 2019-01-17T22:08:56Z

As discussed above, we're going to try an approach that does not require this.

Add http connection pool idle.

3585b74

We want to know when a connection pool is no longer processing requests. This adds a callback mechanism so consumers may be informed when a connection pool transitions into this state. Signed-off-by: Kyle Larose <kyle@agilicus.com>

lizan self-assigned this Jan 16, 2019

lizan reviewed Jan 16, 2019

View reviewed changes

klarose added 5 commits January 17, 2019 08:56

Fix asan bug

256ef3f

Returning the same decoder caused problems. Signed-off-by: Kyle Larose <kyle@agilicus.com>

Merge branch 'master' into add_conn_pool_Idle

f509ec3

Signed-off-by: Kyle Larose <kyle@agilicus.com>

Add gcovr notes for unimplemented functions

f48b47b

Signed-off-by: Kyle Larose <kyle@agilicus.com>

Use callback manager for idle callbacks

19574c7

Signed-off-by: Kyle Larose <kyle@agilicus.com>

Be a little less zealous with gcovr exclusions

3313c64

We were actually running a fucntion called from a parent class. Whoops. Signed-off-by: Kyle Larose <kyle@agilicus.com>

lizan approved these changes Jan 17, 2019

View reviewed changes

lizan added the waiting label Jan 17, 2019

mattklein123 self-assigned this Jan 17, 2019

mattklein123 reviewed Jan 17, 2019

View reviewed changes

klarose closed this Jan 17, 2019

Conversation

klarose commented Jan 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

klarose commented Jan 16, 2019

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lizan left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattklein123 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

klarose Jan 17, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

klarose commented Jan 17, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

klarose commented Jan 16, 2019 •

edited

Loading

klarose Jan 17, 2019 •

edited

Loading