Fox MPC Policy#13

Open

tiavlovsky wants to merge 20 commits intomasterfrom

Contributor

tiavlovsky commented Apr 28, 2021

Prototype of the reactive effective state pressure MPC policy


          preparing the fox mpc policy

686b6b7

tiavlovsky requested review from alexisboukouvalas and sergiovalmac

April 28, 2021 17:16

tiavlovsky added 4 commits

April 28, 2021 18:23


          shortcut fluid policy

6b55fd3


          working version

69c20f4


          augmented version

cc9714d


          back to simple version

07b6878

sergiovalmac reviewed

View reviewed changes

Contributor

sergiovalmac left a comment

Very exciting getting the new policy in place!
We have discussed most of my major comments during the peer-review session. I have added other relevant comments like summarising snippets of code into functions and renaming some variable to enhance readability. Plus some minor comments, mainly missing/outdated docstrings, missing commas, etc.

src/snc/agents/hedgehog/hh_agents/hedgehog_agent_interface.py

-                      load_ph, sigma_2_ph = load_sig
-                      return load_ph, sigma_2_ph
+                      load_ph, sigma_2_ph, w_dirs_to_resources = load_sig
+                      return load_ph, sigma_2_ph, w_dirs_to_resources

Contributor

sergiovalmac May 3, 2021

Please correct the return type annotation, and extend the docstring to account for the new variable in the description and to include :return field.

src/snc/agents/hedgehog/hh_agents/hedgehog_agent_interface.py Outdated

                       """
-                      strategic_idling_tuple = self.strategic_idling_object.get_allowed_idling_directions(state)
+                      strategic_idling_tuple = self.strategic_idling_object.get_allowed_idling_directions(state,
+                                                                                                          safety_stocks_vec)

Contributor

sergiovalmac May 3, 2021

Is passing safety_stocks_vec here general and valid for any strategic idling class? or is it specific of the foresight variant and will break if we use any other class?

Contributor Author

tiavlovsky May 16, 2021

Addressed!

src/snc/agents/hedgehog/hh_agents/hedgehog_agent_interface.py Outdated

Comment on lines +392 to +393

		#kwargs_get_policy = self.serialise_get_policy_kwargs(**kwargs)
		#z_star, _ = self.policy_obj.get_policy(**kwargs_get_policy)

Contributor

sergiovalmac May 3, 2021

Aren't we breaking the code here?
Does this work with any other class different from FoxMpc?

Contributor Author

tiavlovsky May 16, 2021

Reverted!

src/snc/agents/hedgehog/hh_agents/hedgehog_agent_interface.py Outdated

                       # If any resource is starving or countdown ends, then recompute activity rates.
-                      if self.num_steps_to_recompute_policy == 0:
+                      if True:#self.num_steps_to_recompute_policy == 0:

Contributor

sergiovalmac May 3, 2021

Is this a hack for bypassing a json config file?

Contributor Author

tiavlovsky May 16, 2021

Removed!

src/snc/agents/hedgehog/hh_agents/hedgehog_agent_interface.py Outdated

Comment on lines +458 to +462

    
                          #self.num_steps_to_recompute_policy = self.get_num_steps_to_recompute_policy(

                              #current_horizon,

                              #self.hedgehog_hyperparams.horizon_mpc_ratio,

                              #self.hedgehog_hyperparams.minimum_horizon

                          #)

Contributor

sergiovalmac May 3, 2021

Why bypassing all this? Will this work beyond FoxMpc?

Contributor Author

tiavlovsky May 16, 2021

Reverted!

src/snc/agents/steady_state_agents/steady_state_policy_agent.py

Comment on lines +84 to +88

+                          state=state,
+                          x_star = state,
+                          x_eff = state,
+                          r_idling_set = np.array([]),
+                          draining_resources = set(),

Contributor

sergiovalmac May 10, 2021

Is this really needed or a hack to use a different strategic idling class by default?

Contributor Author

tiavlovsky May 17, 2021

simulation to estimate asymptotic covariance uses the same mpc policy as the real agent but passes a different fluid policy. In case of Fox instead of fluid policy, effective state should be passed to mpc policy. how can we work around it here?

src/snc/agents/hedgehog/strategic_idling/strategic_idling_hedgehog_gto.py Outdated

                       :return: set of allowed idling resources with auxiliary variables
                       """
                       w = self._workload_mat @ state
+                      self._safety_stocks_vec = safety_stocks_vec

Contributor

sergiovalmac May 10, 2021

This seems out of nowhere here and difficult to know when it will be used. Can we do anything about that?

src/snc/agents/hedgehog/strategic_idling/strategic_idling_foresight.py Outdated

                       self._compute_num_roll_out_steps()
-                  def get_allowed_idling_directions(self, state: StateSpace) -> StrategicIdlingOutput:
+                  def get_allowed_idling_directions(self, state: StateSpace, safety_stocks_vec) -> StrategicIdlingOutput:

Contributor

sergiovalmac May 10, 2021

This seems out of nowhere here and difficult to know when it will be used. Can we do anything about that? Happy to discuss...

src/snc/agents/hedgehog/strategic_idling/strategic_idling.py Outdated

                           self._cost_per_buffer.T @ x_var
                           + penalty_coeff_w_star * cvx.sum(self._workload_mat @ x_var - w_par))
                       constraints = [self._workload_mat @ x_var >= w_par]
+                      a_mat = np.vstack(self.list_boundary_constraint_matrices)

Contributor

sergiovalmac May 10, 2021

Could we rename a_mat more descriptively, maybe something like resource_to_buffer_mat?

src/snc/agents/hedgehog/strategic_idling/strategic_idling.py Outdated

                   def _find_workload_with_min_eff_cost_by_idling(self, w: WorkloadSpace) -> WorkloadSpace:
                       self._w_param.value = w
+                      self._safety_stocks_param.value = np.zeros_like(self._safety_stocks_vec)

Contributor

sergiovalmac May 10, 2021

Do we need this initialisation if we set it to the right value below in Line 269?

tiavlovsky and others added 15 commits

May 16, 2021 15:19


          revert hedgehog agent interface

a9ae2cf


          new agent class

397c047


          revert agent interface

89ac6a1


          revert big step agent

6c31bca


          remove file from PR

69a1c25


          revert

a1ba33c


          revert

43f69f0


          revert 2

ba59359


          clear core strategic idling

7f5ed00


          cleaning up

fc7f8db


          Update src/snc/agents/hedgehog/safety_stocks.py

af40e88

Co-authored-by: sergiovalmac <serteckian@gmail.com>


          clean strategic idling hedging

1fc7db3


          wofijwa;oefij

ce02b2a


          fox ready to be run

5ad6d4b


          Update src/snc/agents/activity_rate_to_mpc_actions/fox_mpc.py

7f7f615

Co-authored-by: sergiovalmac <serteckian@gmail.com>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet