Rounding by jnelson16 · Pull Request #45 · QuantGov/quantgov

jnelson16 · 2018-04-12T16:02:11Z

No description provided.

complexity builtins

in setup.py

nltk fixes

nltk troubles

OliverSherouse

Small changes needed.

OliverSherouse · 2018-04-13T15:58:08Z

quantgov/__main__.py

        '--probability', action='store_true',
        help='output probabilities instead of predictions')
+    estimate.add_argument(
+        '--precision', default=4,


Set type to int here rather than cast in the function itself.

OliverSherouse · 2018-04-13T16:00:40Z

quantgov/estimator/estimation.py

    texts = (doc.text for doc in streamer)
-    yield from zip(streamer.index, pipeline.predict_proba(texts))
+    yield from zip(streamer.index, (i.round(int(precision))
+                   for i in pipeline.predict_proba(texts)))


Round at the array level, not the obvservation level. So:

yield from zip(streamer.index, pipeline.predict_proba(texts).round(precision))

OliverSherouse · 2018-04-13T16:01:18Z

quantgov/estimator/estimation.py

    predicted = pipeline.predict_proba(texts)
    for i, docidx in enumerate(streamer.index):
-        yield docidx, tuple(label_predictions[i]
+        yield docidx, tuple(label_predictions[i].round(int(precision))


jnelson16 · 2018-04-13T17:01:57Z

@OliverSherouse fixed those changes

OliverSherouse · 2018-04-13T17:17:32Z

quantgov/estimator/estimation.py

    texts = (doc.text for doc in streamer)
    truecol = list(int(i) for i in model.model.classes_).index(1)
-    predicted = ((round(i[truecol], int(precision))
+    predicted = ((round(i[truecol], precision)


Why aren't we selecting the whole column and rounding at once? I think that would be pipeline.predict_proba[:,truecol].round(precision).

* Inaugurated 0.4.0 dev series * Sentiment analysis (#33) Closes #11 #12 #13 and adds Sentiment analysis! * complexity * complexity builtins * complexity builtins with tests * code review updates * option tests * added nltk requirement in setup.py * add pip install to .travis.yml * nltk fixes * another nltk fix * last nltk fix? * you know the drill * Update .travis.yml * nltk troubles * some final cleanup * if it aint broke... * textblob sentiment * tests and error raising * fixed install req * pep8 fixes * code review updates * fix travis file * import fixes * small fix * Test corpora (#35) * complexity * complexity builtins * complexity builtins with tests * code review updates * option tests * added nltk requirement in setup.py * add pip install to .travis.yml * nltk fixes * another nltk fix * last nltk fix? * you know the drill * Update .travis.yml * nltk troubles * some final cleanup * new corpora in English!! * hotfix to add timestamp as corpus identifier * Skl compatibility (#41) * Add sklearn 0.17 compatibility Paper over library reorganization. * renamed corpora to corpus, added deprecation warning (#42) * renamed corpora to corpus, added deprecation warning * moved load_driver and set up for future forcing of full imports of submodules Closes #31 * S3 drivers (#44) * initial working commit for s3 driver and database driver * removing 3.6 formatting * adding extra requirements list * adding basic s3 driver test * Removing unnecessary function * This ain't 2007 * test updates * adding s3driver to new corpus structure * Rounding (#45) * bumped version

* Inaugurated 0.4.0 dev series * Sentiment analysis (#33) Closes #11 #12 #13 and adds Sentiment analysis! * complexity * complexity builtins * complexity builtins with tests * code review updates * option tests * added nltk requirement in setup.py * add pip install to .travis.yml * nltk fixes * another nltk fix * last nltk fix? * you know the drill * Update .travis.yml * nltk troubles * some final cleanup * if it aint broke... * textblob sentiment * tests and error raising * fixed install req * pep8 fixes * code review updates * fix travis file * import fixes * small fix * Test corpora (#35) * complexity * complexity builtins * complexity builtins with tests * code review updates * option tests * added nltk requirement in setup.py * add pip install to .travis.yml * nltk fixes * another nltk fix * last nltk fix? * you know the drill * Update .travis.yml * nltk troubles * some final cleanup * if it aint broke... * new corpora in English!! * hotfix to add timestamp as corpus identifier * Skl compatibility (#41) * Add sklearn 0.17 compatibility Paper over library reorganization. * renamed corpora to corpus, added deprecation warning (#42) * renamed corpora to corpus, added deprecation warning * moved load_driver and set up for future forcing of full imports of submodules Closes #31 * S3 drivers (#44) * initial working commit for s3 driver and database driver * removing 3.6 formatting * adding extra requirements list * adding basic s3 driver test * Removing unnecessary function * This ain't 2007 * test updates * adding s3driver to new corpus structure * Rounding (#45) * bumped version * Fix NLTK loading bug Fix evaluation order when NLTK is not present

* hotfix to add timestamp as corpus identifier (#39) * bumped version * Release 0.4 (#47) * Inaugurated 0.4.0 dev series * Sentiment analysis (#33) Closes #11 #12 #13 and adds Sentiment analysis! * complexity * complexity builtins * complexity builtins with tests * code review updates * option tests * added nltk requirement in setup.py * add pip install to .travis.yml * nltk fixes * another nltk fix * last nltk fix? * you know the drill * Update .travis.yml * nltk troubles * some final cleanup * if it aint broke... * textblob sentiment * tests and error raising * fixed install req * pep8 fixes * code review updates * fix travis file * import fixes * small fix * Test corpora (#35) * complexity * complexity builtins * complexity builtins with tests * code review updates * option tests * added nltk requirement in setup.py * add pip install to .travis.yml * nltk fixes * another nltk fix * last nltk fix? * you know the drill * Update .travis.yml * nltk troubles * some final cleanup * new corpora in English!! * hotfix to add timestamp as corpus identifier * Skl compatibility (#41) * Add sklearn 0.17 compatibility Paper over library reorganization. * renamed corpora to corpus, added deprecation warning (#42) * renamed corpora to corpus, added deprecation warning * moved load_driver and set up for future forcing of full imports of submodules Closes #31 * S3 drivers (#44) * initial working commit for s3 driver and database driver * removing 3.6 formatting * adding extra requirements list * adding basic s3 driver test * Removing unnecessary function * This ain't 2007 * test updates * adding s3driver to new corpus structure * Rounding (#45) * bumped version * Fix NLTK loading bug Fix evaluation order when NLTK is not present

Jonathan Nelson and others added 29 commits November 6, 2017 13:32

complexity

5028e17

complexity builtins

f540d17

complexity builtins with tests

6731383

Merge pull request #1 from jnelson16/complexity

7fc8d01

complexity builtins

code review updates

1aabb9a

option tests

43f4d37

added nltk requirement

d121398

in setup.py

add pip install to .travis.yml

a5c15c9

nltk fixes

c1ebb76

Merge pull request #2 from jnelson16/troubleshoot

1da893e

nltk fixes

another nltk fix

d731de1

last nltk fix?

1b0e35a

you know the drill

b1e142e

Update .travis.yml

41b17f8

nltk troubles

842204b

Merge pull request #3 from jnelson16/nltk-troubles

ead81fc

nltk troubles

some final cleanup

3f95a98

if it aint broke...

f9bd220

upstream merge

fe2249e

hotfix to add timestamp as corpus identifier (#39)

6b793a2

rounding probabilities

1a928ba

some estimator tests

c6af9e8

multiclass test and rounding arrays

bf3de83

upstream merge

ed172a4

Merge branch 'dev' of https://github.com/QuantGov/quantgov into rounding

49710e6

fixed tests to newer corpus

9564f6f

remove estimator .gitignore

7d270f8

pep8 fixes

be9fb29

indentation fix

9d35f1e

jnelson16 requested a review from OliverSherouse April 12, 2018 16:02

jnelson16 requested a review from mgasvoda April 12, 2018 16:02

mgasvoda approved these changes Apr 12, 2018

View reviewed changes

OliverSherouse suggested changes Apr 13, 2018

View reviewed changes

Oliver fixes

4aabc35

OliverSherouse reviewed Apr 13, 2018

View reviewed changes

Oliver fixes take 2

babfbca

OliverSherouse approved these changes Apr 13, 2018

View reviewed changes

OliverSherouse merged commit 6686e0d into dev Apr 13, 2018

OliverSherouse deleted the rounding branch April 13, 2018 17:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rounding#45

Rounding#45
OliverSherouse merged 31 commits intodevfrom
rounding

jnelson16 commented Apr 12, 2018

Uh oh!

OliverSherouse left a comment

Uh oh!

OliverSherouse Apr 13, 2018

Uh oh!

OliverSherouse Apr 13, 2018

Uh oh!

OliverSherouse Apr 13, 2018

Uh oh!

jnelson16 commented Apr 13, 2018

Uh oh!

OliverSherouse Apr 13, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jnelson16 commented Apr 12, 2018

Uh oh!

OliverSherouse left a comment

Choose a reason for hiding this comment

Uh oh!

OliverSherouse Apr 13, 2018

Choose a reason for hiding this comment

Uh oh!

OliverSherouse Apr 13, 2018

Choose a reason for hiding this comment

Uh oh!

OliverSherouse Apr 13, 2018

Choose a reason for hiding this comment

Uh oh!

jnelson16 commented Apr 13, 2018

Uh oh!

OliverSherouse Apr 13, 2018

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants