Skip to content

Seed pseudodata#1059

Closed
siranipour wants to merge 28 commits into
masterfrom
seed-pseudodata
Closed

Seed pseudodata#1059
siranipour wants to merge 28 commits into
masterfrom
seed-pseudodata

Conversation

@siranipour
Copy link
Copy Markdown
Contributor

Working on reconstructing the nth N3FIT replica pseudodata.

@siranipour siranipour changed the base branch from master to python-pseudodata January 21, 2021 17:49
@siranipour siranipour marked this pull request as draft January 21, 2021 17:50
@siranipour
Copy link
Copy Markdown
Contributor Author

siranipour commented Jan 21, 2021

@wilsonmr no need to take a look now still very much prototype stage, but one thing that may be difficult is that:

def make_tr_val_mask(datasets, exp_name, seed):

still makes use of datasets. What are our plans for the future of this class? All the functionality we're implementing is either using CommonData or lists there-of.

Moreover, this function still uses the notion of experiment names, which has been deprecated since the data-kw.

@wilsonmr
Copy link
Copy Markdown
Contributor

A few things:

  • I think this function really wants to be an action that takes as input the initialize_seeds action* (or something similar) and then calls make_replica, you might find this considerably easier after both MakeReplica in python #866 and Add python t0 covmat #1052 are merged since the latter starts to add actions with are using the new python machinery.
  • in the end we want to scrap most of reader.py in this instance I would say that you want some reproducible way to generate training validation splits. The splits themselves are easy to gen and can be copied from this func but you may wish to change how the splits are seeded if you don't like using exp name. I wouldn't get completely hung up on this right now but if you take data as input for now then you can extract the name from the data and use that in the hash to get a seed. Alternatively you could take group_name="data" as an argument like produce_data but having defaults being set all over the place is something we kind of want to avoid so if we were to go down that route we should use the lockfile mechanism to set the default.

*note its an n3fit action so would have to be moved in order to be accessed in vp

siranipour and others added 22 commits January 27, 2021 15:13
No attempt at optimization has been done
Uses the same seed structure as C++ so will give
the same pseudodata if the same RNG seed is used.
This commit is a proof of concept.
This removes the for loop implementation
Also correcting some typos in the docstring of covmat_from_systematics
Co-authored-by: Cameron Voisey <32741139+voisey@users.noreply.github.com>
Thus it remembers the RNG state
Co-authored-by: Zaharid <zk261@cam.ac.uk>
Adding Zahari's review comments
Assigning seed randomly if not provided explicilty in function call
Co-authored-by: wilsonmr <33907451+wilsonmr@users.noreply.github.com>
Co-authored-by: Rosalyn Pearson <33020850+RosalynLP@users.noreply.github.com>
Base automatically changed from python-pseudodata to master February 26, 2021 11:50
@siranipour siranipour closed this Mar 10, 2021
@scarlehoff scarlehoff deleted the seed-pseudodata branch November 14, 2024 10:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants