new-audit(unminified-javascript): detect savings from minifcation by patrickhulce · Pull Request #3950 · GoogleChrome/lighthouse

patrickhulce · 2017-11-29T20:01:54Z

addresses js part of #3459 using the 3rd approach outlined

strategy: estimate minification savings by determining the ratio of the length of js tokens to overall string length

this was surprisingly accurate at identifying if a script was already minified, but is only ~6 lines using esprima and takes ~30ms/MB to tokenize compared to ~2000ms/MB for uglify and even longer for babel-minify

below is a table outlining the observed savings, w/gzip denotes the % savings after accounting for gzip, which is usually lower because minification tends to remove things that compress well

library	uglify	uglify w/gzip	babel-minify	babel-minify w/gzip	our estimate	old calculation
angular	86.1%	80.3%	85.8%	80.3%	74.5%	`(74.5 + 88.8 ) / 2`
caltrainschedule.io	--	--	50%	50%	31.3%	`(31.3 + 63.1) / 2`
react	71.4%	50%	71.4%	50%	49.4%	`(49.4% + 75.2%) / 2`
lodash	86.3%	75%	86.3%	70.8%	73.3%	`(73.3% + 87.9%) / 2`
jquery	66.6%	60%	66.6%	60%	49.1%	`(49.1% + 74.4%) / 2`
modernizr	76.9%	50%	76.9%	50%	67.9%	`(67.9% + 82.1%) / 2`

identifying the already minified scripts is as easy as checking if the savings is low as no production minified script had more than 5% savings

if this all looks good, I'll go ahead and add tests, remove WIP 👍

relevant code section for estimating minification is

lighthouse/lighthouse-core/audits/byte-efficiency/unminified-javascript.js

Lines 34 to 42 in 321df67

    
           let tokenLength = 0; 
        
           let tokenLengthWithMangling = 0; 
        
           const tokens = esprima.tokenize(scriptContent); 
        
           for (const token of tokens) { 
        
             tokenLength += token.value.length; 
        
             // assume all identifiers could be reduced to a single character 
        
             tokenLengthWithMangling += token.type === 'Identifier' ? 1 : token.value.length; 
        
           }

patrickhulce · 2017-11-29T20:20:42Z

results from cnn.com flag some unminified js from turner cdn, and possibly a bug of theirs since the files even have "ugly" in the name 😆

addyosmani · 2017-12-06T17:52:49Z

I was surprised how close the margin of error was using your simplified lexer for minified savings here. Nice work, @patrickhulce!

paulirish · 2017-12-06T18:04:27Z

lighthouse-core/audits/byte-efficiency/unminified-javascript.js

+      tokenLengthWithMangling += token.type === 'Identifier' ? 1 : token.value.length;
+    }
+
+    if (1 - tokenLength / contentLength < IGNORE_THRESHOLD_IN_PERCENT) return null;


let's add a comment to indicate this is for handling pre-minified code. \o/

patrickhulce · 2017-12-08T17:56:10Z

aw, it finally happened :(

paulirish

nice. The change in fd547cd was the big tweak I think this needed, making it more conservative and reducing our chance of false positives.

Naturally this does increase our bundle size? bundlesize isn't describing how much though.. do you know what the delta is? (Just want to have a record of it)

paulirish · 2017-12-18T19:22:11Z

lighthouse-core/audits/byte-efficiency/unminified-javascript.js

+    };
+  }
+
+  /**


can you add a comment here explaining the basic approach? using the description from this PR works for me.

let's also call out that inline scripts are not evaluated. (i dont think it matters in practice, but since other audits of ours include inline scripts we should just be explicit)

yeah sounds good 👍

done

paulirish · 2017-12-18T19:31:07Z

lighthouse-core/gather/gatherers/scripts.js

+            .catch(_ => null)
+            .then(content => {
+              if (!content) return;
+              scriptContentMap.set(record.url, content);


since we're definitely dealing with networkRecords i'd rather be using requestIds here as the key.

over in the audit we could could then use WebInspector.NetworkLog.requestForId() to grab the request and pull the URL off that. wdyt?

i suppose this'll make testing slightly harder, so curious what you think.

yeah that's fair, if same URL was requested multiple times with different content we should surface that 👍

paulirish · 2017-12-20T02:30:05Z

lighthouse-core/audits/byte-efficiency/unminified-javascript.js

 const esprima = require('esprima');

-const IGNORE_THRESHOLD_IN_PERCENT = .1;
+const IGNORE_THRESHOLD_IN_PERCENT = 10;


lol looks like past audits use a mix:

would be nice to have types for this and milliseconds v seconds. damn.

lol, yeah I ended up switching because the wastedPercent value is x/100, it's the wastedRatio thats x/1

new-audit: add unminified javascript audit

321df67

patrickhulce requested review from brendankenny and paulirish as code owners November 29, 2017 20:01

paulirish reviewed Dec 6, 2017

View reviewed changes

move out of WIP

fd547cd

patrickhulce changed the title ~~WIP: new-audit: add unminified javascript audit~~ new-audit: add unminified javascript audit Dec 7, 2017

patrickhulce changed the title ~~new-audit: add unminified javascript audit~~ new-audit(unminified-javascript): detect savings from minifying javascript Dec 7, 2017

Merge branch 'master' into js_minified

73e25ad

patrickhulce changed the title ~~new-audit(unminified-javascript): detect savings from minifying javascript~~ new-audit(unminified-javascript): detect savings from minifcation Dec 8, 2017

paulirish mentioned this pull request Dec 11, 2017

Audit: Resources are served minified #3459

Closed

patrickhulce added the waiting4reviewer label Dec 13, 2017

patrickhulce assigned paulirish Dec 13, 2017

paulirish requested changes Dec 18, 2017

View reviewed changes

feedback

25508c7

patrickhulce force-pushed the js_minified branch from 9c65ab0 to 25508c7 Compare December 18, 2017 21:40

paulirish approved these changes Dec 20, 2017

View reviewed changes

paulirish merged commit 017c9c1 into master Dec 20, 2017

paulirish deleted the js_minified branch December 20, 2017 02:31

paulirish removed the waiting4reviewer label Mar 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new-audit(unminified-javascript): detect savings from minifcation#3950

new-audit(unminified-javascript): detect savings from minifcation#3950
paulirish merged 4 commits intomasterfrom
js_minified

patrickhulce commented Nov 29, 2017 •

edited

Loading

Uh oh!

patrickhulce commented Nov 29, 2017 •

edited

Loading

Uh oh!

addyosmani commented Dec 6, 2017

Uh oh!

paulirish Dec 6, 2017

Uh oh!

patrickhulce commented Dec 8, 2017

Uh oh!

paulirish left a comment

Uh oh!

paulirish Dec 18, 2017

Uh oh!

patrickhulce Dec 18, 2017

Uh oh!

paulirish Dec 18, 2017

Uh oh!

patrickhulce Dec 18, 2017

Uh oh!

paulirish Dec 20, 2017

Uh oh!

patrickhulce Dec 20, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	let tokenLength = 0;
	let tokenLengthWithMangling = 0;

	const tokens = esprima.tokenize(scriptContent);
	for (const token of tokens) {
	tokenLength += token.value.length;
	// assume all identifiers could be reduced to a single character
	tokenLengthWithMangling += token.type === 'Identifier' ? 1 : token.value.length;
	}

+                  };
+                }
+                /**

Conversation

patrickhulce commented Nov 29, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patrickhulce commented Nov 29, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

addyosmani commented Dec 6, 2017

Uh oh!

paulirish Dec 6, 2017

Choose a reason for hiding this comment

Uh oh!

patrickhulce commented Dec 8, 2017

Uh oh!

paulirish left a comment

Choose a reason for hiding this comment

Uh oh!

paulirish Dec 18, 2017

Choose a reason for hiding this comment

Uh oh!

patrickhulce Dec 18, 2017

Choose a reason for hiding this comment

Uh oh!

paulirish Dec 18, 2017

Choose a reason for hiding this comment

Uh oh!

patrickhulce Dec 18, 2017

Choose a reason for hiding this comment

Uh oh!

paulirish Dec 20, 2017

Choose a reason for hiding this comment

Uh oh!

patrickhulce Dec 20, 2017

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

patrickhulce commented Nov 29, 2017 •

edited

Loading

patrickhulce commented Nov 29, 2017 •

edited

Loading