Feature/pre train subreddit by Jaxing · Pull Request #15 · recnet/model

Jaxing · 2017-04-05T09:17:23Z

This enables to pre-train the network on different output labels for the same input data. Specifically to use subreddits as lables for pre-training.
In main.py the a new layer is added after adding secondary output layer, this is so that the network can generelise from the learnt prameters and when tested showed better preformance. Where in the network the secondary output should be placed should be treated as a hyperparameter.

hsson · 2017-04-05T10:35:12Z

        print("Starting training...")

-        if self.use_pretrained:
+        if self.use_pretrained and \


This if statement can be simplified to:

if self.use_pretrained and (self.use_pretrained_net == use_pretrained_net)

Also, what does this actually mean? This pull request doesn't change behavior of the embedding matrix right? So why is it required that self.use_pretrained_net == use_pretrained_net ? And why are there two separate variables for use_pretrained_net?

We only want to initialise the embedding matrix once but we run train twice therefore we want to run it only if both self.use_pretrained_net and use_pretrained_net is true i.e. we want to pre-train on subreddits and this call to train will pre-train or if both of them are false i.e. we don't want to pre-train and this call don't do pre-train.
That's just bad naming, the class variable is whether the model should use pre-training on the network and the parameter is whether this call to the method should pre-train the network

hsson · 2017-04-05T10:35:57Z

    validation_data: 'validation_data_top_n_single.csv'
    training_data: 'training_data_top_n_single.csv'
    testing_data: 'testing_data_top_n_single.csv'
+    pre_train_data: 'pre-train_data_top_n.csv'


hsson · 2017-04-05T10:37:24Z

            builder = ModelBuilder(config_file, sess)
+            builder.add_input_layer()
+
+            # Add a number of hidden layers


Why is this done in main.py? Shouldn't it be done in the build function of the model builder?

This is more modular, if someone want to set up the network in another way than how it is done in the build method then there is no way of building the model

hsson · 2017-04-05T10:38:33Z

+                .add_precision_operations()
+
            network_model = builder.build()
+            if config_file[USE_PRETRAINED_NET]:


Can't this be moved to the model itself? To the beginning of the train function maybe? Just tryting to keep the main.py file as clean as possible 😄

hsson · 2017-04-05T10:42:52Z

            self._session.run(self.embedding_init,
                              feed_dict={self.embedding_placeholder:
                                         self.data.embedding_matrix})
-        self.train_writer = \


Why are the writers moved to the model builder?

Perhaps not necessary but otherwise the writer initiated twice, once for each time we train.

hsson · 2017-04-05T10:51:04Z


        return self

+    def add_secondary_output(self):


This is very similar to the other output function. Is it possible to combine them and maybe take the labels as an input or something?

Might be, one problem is that we don't want to overwrite the training op for either could maybe be dealt with using a boolean

hsson · 2017-04-05T10:51:23Z

    TRAINING = "training_data"
    VALIDATION = "validation_data"
-
+    PRE_TRAINING = "pre_training_data"


hsson · 2017-04-05T10:52:23Z

        self.constant_prediction_limit = config[CONSTANT_PREDICTION_LIMIT]
        self.use_concat_input = config[USE_CONCAT_INPUT]
+        self.use_pretrained_net = config[USE_PRETRAINED_NET]
+        self.subreddit_count = 0


Is this variable used?

hsson · 2017-04-05T11:43:02Z

Also, I just realised that the concatenated subreddit input is still used when pre-training on subreddits. Maybe this should not be concatenated when doing that? Seems a bit biased. At the same time... The dimensions of the weights depend the size of the input.

hsson · 2017-04-05T14:30:19Z

Use softmax for subreddit predictions.
Output layers should be last

…hlights/model into feature/pre-train-subreddit

hsson · 2017-04-05T17:48:51Z

Feedback from meeting has been implemented and tested.

Jesper Jaxing added 7 commits April 3, 2017 13:28

merge

db3ec23

added method to add secondary output layer

bc9154d

implemented pre-training on subreddits

1afbf9f

create methods for handling pre-training data

95b779e

non-working commit to allow for debugg help.

4afbf98

changed which variable was used to check if pre-training

193811d

only add sec_output when pre-train and add an extra layer after

e5be638

hsson reviewed Apr 5, 2017

View reviewed changes

Jesper Jaxing and others added 20 commits April 5, 2017 17:38

changed config to not include unnecissary parameters

d94d187

cleaned main.py by adding it to builder

fa9859c

sec_output uses softmax since a title only have one subreddit

1de1133

refactored add output method to one method

5c2db6c

Adds the option to choose between GRU and LSTM units

2374ed4

Adds a missing variable name change

f46c8ce

Changes default dataset in config template

634d79b

merge

d73a5c4

added method to add secondary output layer

468d07d

implemented pre-training on subreddits

2e957a3

create methods for handling pre-training data

796a81d

non-working commit to allow for debugg help.

81c149d

changed which variable was used to check if pre-training

f5c49ed

only add sec_output when pre-train and add an extra layer after

8c5cb78

cleaned main.py by adding it to builder

7383a0b

sec_output uses softmax since a title only have one subreddit

157987d

refactored add output method to one method

90086ab

Merge branch 'feature/pre-train-subreddit' of github.com:kandidat-hig…

2561556

…hlights/model into feature/pre-train-subreddit

Refactors away redundant function in model builder

65cf80c

Removes un-used constant

fbe203f

Removes unused epoch counting for pre-training

ac40573

hsson merged commit 8158619 into dev Apr 5, 2017

hsson deleted the feature/pre-train-subreddit branch April 5, 2017 17:49

Conversation

Jaxing commented Apr 5, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hsson commented Apr 5, 2017

Uh oh!

hsson commented Apr 5, 2017

Uh oh!

hsson commented Apr 5, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants