Skip to content

postyear/DBRF

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

EcoForest(DBRF)

Eco-Forest is more effective version of gcForest

GOAL

To Do

  1. Analysis sklearn Tree source code (DONE)
  2. Analysis sklearn ensemble source code (DONE)
  3. Add Utils code: ForestUtils.py (DONE)
  4. Add Utils code: EnhancedDTree.py (DONE)
  5. Add Utils code: EnhancedForest.py (DONE)
  6. Case Study: Forest Driver dataset test (DONE)
  7. LayerForest v0.1 - LayerDTree (DONE)
  • finish layer structure of DTree
  • spliting the data by the gini value of leaf node
  • test model by globel vaild data
  • output the test data
  1. LayerForest v0.2 - LayerDTree (DONE)
  • driver dataset
  • EnhancedDTree.py
  1. LayerForest v0.3 - LayerDTree (DONE)
  • uci adult dataset
  • compare with xgb, rf, decisiontree
  • EnhancedDTree.py
  1. LayerForest v0.4 - LayerDTree++ (DONE)
  • eliminate the overfit result
  • k-fold
  1. LayerForest v0.5 - LayerForest (DONE)
  • EnhancedForest.py
  • bug: misconvergence
  1. LayerForest v0.6 - LayerForest (ING)
  • [D] debug: eliminate miscovergence [Dropout \ Batch Normalization]
  • debug: eliminate overfit
  • debug: eliminate overquick covergence [?]
  • exceed xgb, rf, decisiontree
  • all to do:
    • k-fold train [v]
    • avg predict [v]
    • threshold of all imp - avg [v]
    • dropout by score of est [v]
    • dropout by score of tree [x]
    • LR predict [v]
    • vaild data split [v]
  1. LayerForest v0.7 (Done)
  • Simplify Procedure
    • Data Load Utils
  • Multiclass Support
  1. LayerForest v0.8 (ING)
  • Simplify Procedure
    • Model Utils
    • DecomposerForest
    • AlgorithmUtils
  1. LayerForest v1.0 - ecoForest
  • Vaild Data Split
  • MaxLayer Control
  • Train/Vaild Loss Guide
  • Freq/Lift/Support Score
  • K-Flod
  • Smart Early Stop
  • LR Stacker

Other Files

EnhancedForest_multiclass_v0.2: before AlgorithmUtils. 12.10

Using Data Introduction

Happy Hacking.

About

DBRF code and reviews

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 97.7%
  • Python 2.3%