Perf: Optimize sending HTTP/2 frame by masaori335 · Pull Request #6337 · apache/trafficserver

masaori335 · 2020-01-20T05:24:05Z

Prior to this change, HTTP/2 was almost 30% slower than HTTP/1.1 (over TLS) on downloading a huge file (over 1GB).

Improvements:

Avoid unnecessary IOBufferBlock allocation for all type of frame
Avoid unnecessary copy on sending DATA frame
Adjust IOBufferBlock size of Http2ClientSession::write_buffer

Cleanups:

Decouple receiving & sending HTTP/2 Frame
Remove unnecessary SCOPED_MUTEX_LOCK

Another approach of #5916.
Some features like adding padding are unimplemented as is.

randall · 2020-01-20T15:51:00Z

[approve ci autest]

maskit · 2020-01-21T01:02:41Z

Another approach of #5916.

It seems like #5961 also have an improvement for send_a_data_frame. Does this PR completely replace #5916, or do we need it also?

masaori335 · 2020-01-21T04:31:29Z

Another approach of #5916.

It seems like #5961 also have an improvement for send_a_data_frame. Does this PR completely replace #5916, or do we need it also?

IIUC, this covers all that #5916 tries to fix. I'd like @shinrich to make sure.

Prior to this change, HTTP/2 was almost 30% slower than HTTP/1.1 (over TLS) on downloading a huge file (over 1GB). Improvements: - Avoid unnecessary IOBufferBlock allocation for all type of frame - Avoid unnecessary copy on sending DATA frame - Adjust IOBufferBlock size of Http2ClientSession::write_buffer Cleanups: - Decouple receiving & sending HTTP/2 Frame - Remove unnecessary SCOPED_MUTEX_LOCK

shinrich · 2020-01-22T22:49:47Z

+      written += iobuffer->write(this->_reader->start(), read_len);
+      this->_reader->consume(read_len);
+    }
+    len += written;


I know that we were playing with different versions of the Data frame writing. One that wrote to contiguous buffers and passed that along to SSL_write(). Another that just passed along pointers from the original buffer (iobuffer in the case) and used the block pointers directly to write to SSL_write. Did you find that performance was better with the extra intermediate copy to hopefully get bigger blocks?

PR #5897 has the logic that was testing writing to the SSL_write directly from the iobuffer blocks.

I compared these two approaches and the first approach is so much better.
The second one looks cool because of "no copy". However, it didn't improve performance as expected.
SSL_write() is expensive than memcpy(), so reducing SSL_write() calls is key point, IMO.

We can get rid of the memcpy() when SSL/TLS libraries provide lower APIs which slice SSL_write() functionalities.

Actually, our QUIC implementation keeps the IOBufferBlock chain to avoid memcpy(). Unfortunately, we can't take the same approach here, because it's using the EVP cipher functions directly.

trafficserver/iocore/net/quic/QUICPacketPayloadProtector_openssl.cc

Lines 30 to 84 in 504cc9f

bool

QUICPacketPayloadProtector::_protect(uint8_t *cipher, size_t &cipher_len, size_t max_cipher_len, const Ptr<IOBufferBlock> plain,

uint64_t pkt_num, const uint8_t *ad, size_t ad_len, const uint8_t *key, const uint8_t *iv,

size_t iv_len, const EVP_CIPHER *aead, size_t tag_len) const

{

EVP_CIPHER_CTX *aead_ctx;

int len;

uint8_t nonce[EVP_MAX_IV_LENGTH] = {0};

size_t nonce_len = 0;

this->_gen_nonce(nonce, nonce_len, pkt_num, iv, iv_len);

if (!(aead_ctx = EVP_CIPHER_CTX_new())) {

return false;

}

if (!EVP_EncryptInit_ex(aead_ctx, aead, nullptr, nullptr, nullptr)) {

return false;

}

if (!EVP_CIPHER_CTX_ctrl(aead_ctx, EVP_CTRL_AEAD_SET_IVLEN, nonce_len, nullptr)) {

return false;

}

if (!EVP_EncryptInit_ex(aead_ctx, nullptr, nullptr, key, nonce)) {

return false;

}

if (!EVP_EncryptUpdate(aead_ctx, nullptr, &len, ad, ad_len)) {

return false;

}

cipher_len = 0;

Ptr<IOBufferBlock> b = plain;

while (b) {

if (!EVP_EncryptUpdate(aead_ctx, cipher + cipher_len, &len, reinterpret_cast<unsigned char *>(b->buf()), b->size())) {

return false;

}

cipher_len += len;

b = b->next;

}

if (!EVP_EncryptFinal_ex(aead_ctx, cipher + cipher_len, &len)) {

return false;

}

cipher_len += len;

if (max_cipher_len < cipher_len + tag_len) {

return false;

}

if (!EVP_CIPHER_CTX_ctrl(aead_ctx, EVP_CTRL_AEAD_GET_TAG, tag_len, cipher + cipher_len)) {

return false;

}

cipher_len += tag_len;

EVP_CIPHER_CTX_free(aead_ctx);

return true;

}

shinrich

Looks good. I'm glad that we aren't losing the work we did on HTTP/2 performance improvements. What was the % performance comparison for the 1GB download after making these changes?

masaori335 · 2020-01-23T00:32:15Z

HTTP/2 performance becomes almost the same as HTTP/1.1 with this PR.

	master-h1	master-h2	6337-h2
90 %tile	1.018559	1.309419	1.027845
80 %tile	1.016833	1.301963	1.017787

Measured total time of downloading 1GB file from local box like below.

$ cat curlformat
%{time_total}\n
$ for x in {0..10}; do curl --http1.1 -s -k "https://127.0.0.1:4443/1gb" -o /dev/null -w @curlformat >> master-h1-1gb.log; done
$ for x in {0..10}; do curl -s -k "https://127.0.0.1:4443/1gb" -o /dev/null -w @curlformat >> master-h1-1gb.log; done

maskit

Looks good.

zwoop · 2020-02-27T17:30:46Z

Cherry-picked to v9.0.x branch.

masaori335 added HTTP/2 Performance labels Jan 20, 2020

masaori335 added this to the 10.0.0 milestone Jan 20, 2020

masaori335 self-assigned this Jan 20, 2020

masaori335 force-pushed the h2-perf-frame branch from 1125332 to 29d2048 Compare January 20, 2020 05:28

masaori335 requested review from shinrich and vmamidi January 20, 2020 05:32

maskit reviewed Jan 21, 2020

View reviewed changes

Comment thread proxy/http2/Http2ConnectionState.cc Outdated

maskit reviewed Jan 21, 2020

View reviewed changes

Comment thread proxy/http2/Http2ClientSession.cc Outdated

maskit reviewed Jan 21, 2020

View reviewed changes

Comment thread proxy/http2/Http2Frame.h Outdated

masaori335 force-pushed the h2-perf-frame branch 2 times, most recently from 3a7ffb0 to 689e8ac Compare January 21, 2020 23:23

masaori335 force-pushed the h2-perf-frame branch from 689e8ac to 11786c4 Compare January 22, 2020 00:01

shinrich reviewed Jan 22, 2020

View reviewed changes

shinrich approved these changes Jan 22, 2020

View reviewed changes

maskit approved these changes Jan 23, 2020

View reviewed changes

masaori335 merged commit 48bcbe6 into apache:master Jan 24, 2020

This was referenced Feb 5, 2020

Reduce copies for data frames but copy header and data to reduce SSL record count #5916

Closed

Eliminate extra copies for data frames #5897

Closed

zwoop added the OnDocs This is for PR currently running, or will run, on the Docs ATS server label Feb 21, 2020

zwoop modified the milestones: 10.0.0, 9.0.0 Feb 27, 2020

zwoop removed the OnDocs This is for PR currently running, or will run, on the Docs ATS server label Feb 27, 2020

masaori335 mentioned this pull request Mar 31, 2020

HTTP/2 backports for 8.1.x (part 3) #6599

Merged

zwoop modified the milestones: 9.0.0, 8.1.0 Apr 6, 2020

jvgutierrez mentioned this pull request Apr 15, 2020

Bug fixes to h2 buffering #6643

Merged

maskit mentioned this pull request Oct 16, 2020

Use client setting for max_frame size #5896

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perf: Optimize sending HTTP/2 frame#6337

Perf: Optimize sending HTTP/2 frame#6337
masaori335 merged 1 commit intoapache:masterfrom
masaori335:h2-perf-frame

masaori335 commented Jan 20, 2020 •

edited

Loading

Uh oh!

randall commented Jan 20, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maskit commented Jan 21, 2020

Uh oh!

masaori335 commented Jan 21, 2020

Uh oh!

shinrich Jan 22, 2020 •

edited

Loading

Uh oh!

masaori335 Jan 22, 2020

Uh oh!

masaori335 Jan 23, 2020

Uh oh!

shinrich left a comment

Uh oh!

masaori335 commented Jan 23, 2020

Uh oh!

maskit left a comment

Uh oh!

zwoop commented Feb 27, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

	bool
	QUICPacketPayloadProtector::_protect(uint8_t *cipher, size_t &cipher_len, size_t max_cipher_len, const Ptr<IOBufferBlock> plain,
	uint64_t pkt_num, const uint8_t ad, size_t ad_len, const uint8_t key, const uint8_t *iv,
	size_t iv_len, const EVP_CIPHER *aead, size_t tag_len) const
	{
	EVP_CIPHER_CTX *aead_ctx;
	int len;
	uint8_t nonce[EVP_MAX_IV_LENGTH] = {0};
	size_t nonce_len = 0;

	this->_gen_nonce(nonce, nonce_len, pkt_num, iv, iv_len);

	if (!(aead_ctx = EVP_CIPHER_CTX_new())) {
	return false;
	}
	if (!EVP_EncryptInit_ex(aead_ctx, aead, nullptr, nullptr, nullptr)) {
	return false;
	}
	if (!EVP_CIPHER_CTX_ctrl(aead_ctx, EVP_CTRL_AEAD_SET_IVLEN, nonce_len, nullptr)) {
	return false;
	}
	if (!EVP_EncryptInit_ex(aead_ctx, nullptr, nullptr, key, nonce)) {
	return false;
	}
	if (!EVP_EncryptUpdate(aead_ctx, nullptr, &len, ad, ad_len)) {
	return false;
	}

	cipher_len = 0;
	Ptr<IOBufferBlock> b = plain;
	while (b) {
	if (!EVP_EncryptUpdate(aead_ctx, cipher + cipher_len, &len, reinterpret_cast<unsigned char *>(b->buf()), b->size())) {
	return false;
	}
	cipher_len += len;
	b = b->next;
	}

	if (!EVP_EncryptFinal_ex(aead_ctx, cipher + cipher_len, &len)) {
	return false;
	}
	cipher_len += len;

	if (max_cipher_len < cipher_len + tag_len) {
	return false;
	}
	if (!EVP_CIPHER_CTX_ctrl(aead_ctx, EVP_CTRL_AEAD_GET_TAG, tag_len, cipher + cipher_len)) {
	return false;
	}
	cipher_len += tag_len;

	EVP_CIPHER_CTX_free(aead_ctx);

	return true;
	}

Conversation

masaori335 commented Jan 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

randall commented Jan 20, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maskit commented Jan 21, 2020

Uh oh!

masaori335 commented Jan 21, 2020

Uh oh!

shinrich Jan 22, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

masaori335 Jan 22, 2020

Choose a reason for hiding this comment

Uh oh!

masaori335 Jan 23, 2020

Choose a reason for hiding this comment

Uh oh!

shinrich left a comment

Choose a reason for hiding this comment

Uh oh!

masaori335 commented Jan 23, 2020

Uh oh!

maskit left a comment

Choose a reason for hiding this comment

Uh oh!

zwoop commented Feb 27, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

masaori335 commented Jan 20, 2020 •

edited

Loading

shinrich Jan 22, 2020 •

edited

Loading