trigger workflows to run for `pull_request` event by seanpreston · Pull Request #86 · ethyca/fidesops

seanpreston · 2021-11-18T11:29:56Z

Purpose

This PR adds the pull_request_target event to our workflows, which should allow PRs on forked repos to trigger workflow events (ie. CI via Github Actions).

See: https://docs.github.com/en/actions/learn-github-actions/events-that-trigger-workflows#pull_request_target for full docs.

This event runs in the base context of a pull request, so I believe this change has to be made in the repo itself, before any workflows on forked PRs will run.

As @iamkelllly has mentioned here: #26 (comment), there's also something we'll need to do with the Github tokens, to stop them from being able to read secrets being stored in the underlying repository, so in lieu of completing that here I've changed the repository settings to force us to approve any workflow runs for outside collaborators, which we can choose to not do until we're sure the token issue is handled.

This should yield an approval process like: https://docs.github.com/en/actions/managing-workflow-runs/approving-workflow-runs-from-public-forks

Ticket

Closes #26

NevilleS · 2021-11-18T15:40:32Z

I'd love a few others to weigh in here on the security side (paging @ThomasLaPiana and @PSalant726) just so we get this right. I like the idea of allowing forked PRs to run our actions for CI, but want to ensure we are very careful about how that can expose the GitHub secrets.

A couple articles I found on this:

On my reading I think I'd prefer:

using pull_request (instead of pull_request_target) as the trigger, which should be able to build & run unit tests without any secrets... I think?
only ever using the docker secrets on main, never on any pull_request or pull_request_target workflows
having a separate workflow for the integration tests. this can be manual for now (i.e. a core member checks out and runs the code), but we could also make it so you add a label like safe-to-test and that allows integration tests to run- see the example from here https://securitylab.github.com/research/github-actions-preventing-pwn-requests/

    on:
      pull_request_target:
        types: [labeled]

    jobs:
      build:
        name: Build and test
        runs-on: ubuntu-latest
        if: contains(github.event.pull_request.labels.*.name, 'safe to test')

NevilleS · 2021-11-18T15:41:56Z

-on: [push]
+on: [
+  push,
+  pull_request_target


TL;DR from my comment (#86 (comment))

If you set this to pull_request instead of pull_request_target for now, we'll get 90% of the value of CI but I'd expect tests that rely on secrets to fail. That's a secure place to start from

Building on this, I don't think it achieves what we want anyway. From the documentation linked by Sean:

This event runs in the context of the base of the pull request, rather than in the merge commit as the pull_request event does.

I could be misunderstanding, but I interpret this as "run our workflows against the commit from which the PR was branched", not the latest commit on the PR itself. It's intended to be used as more of a housekeeping trigger, to add labels etc. to PRs. Isn't the intent to run CI actions against the changeset on the PR?

Separately, these changes, as written, don't seem to be using any of the additional permissions afforded by the pull_request_target trigger except to allow for execution of the external resource integration tests. Effectively, this change only serves to expose the ${{ secrets.REDSHIFT_TEST_URI }} and ${{ secrets.SNOWFLAKE_TEST_URI }} values to PRs opened by potentially untrusted users (repo forks).

seanpreston · 2021-11-18T17:03:20Z

If you set this to pull_request instead of pull_request_target for now, we'll get 90% of the value of CI but I'd expect tests that rely on secrets to fail. That's a secure place to start from

Will update.

having a separate workflow for the integration tests. this can be manual for now (i.e. a core member checks out and runs the code), but we could also make it so you add a label like safe-to-test and that allows integration tests to run- see the example from here https://securitylab.github.com/research/github-actions-preventing-pwn-requests/

I'd like to understand more about how the tokens are issued too. If it's the case that the tokens are short lived and can be set to only be generated on manual approval of a job run, it might not be a huge issue either way.

PSalant726

@NevilleS 's suggestion about configuring the external integration tests to only run against PR's labeled safe-to-test (or similar) seems like a good potential route forward.

As another option, we could create an additional, similar automation that restricts the execution of the external integration tests only to PRs opened either 1) from a branch off of the repo itself (not a fork) or 2) by a member of the Ethyca GH org and/or the @ethycadev user group.

I don't see any way for us to safely automate the execution of any job that requires access to repository secrets when the PR being tested was opened by an untrusted user. For PRs opened by external contributors, I would recommend that an Ethyca employee first reviews the code changes here on GH, then, if everything seems safe, checkout the fork's branch locally and execute the external integration tests manually. Since all PRs require an approval from a maintainer before merging anyway, this extra step doesn't seem like that much to ask.

As a last step, we should find a way to mark the external integration tests as "not run", and prevent the pipeline from displaying as passed, so we don't miss the previous manual step.

PSalant726 · 2021-11-18T16:58:14Z

-on: [push]
+on: [
+  push,
+  pull_request_target


Building on this, I don't think it achieves what we want anyway. From the documentation linked by Sean:

This event runs in the context of the base of the pull request, rather than in the merge commit as the pull_request event does.

I could be misunderstanding, but I interpret this as "run our workflows against the commit from which the PR was branched", not the latest commit on the PR itself. It's intended to be used as more of a housekeeping trigger, to add labels etc. to PRs. Isn't the intent to run CI actions against the changeset on the PR?

Separately, these changes, as written, don't seem to be using any of the additional permissions afforded by the pull_request_target trigger except to allow for execution of the external resource integration tests. Effectively, this change only serves to expose the ${{ secrets.REDSHIFT_TEST_URI }} and ${{ secrets.SNOWFLAKE_TEST_URI }} values to PRs opened by potentially untrusted users (repo forks).

NevilleS · 2021-11-18T17:56:56Z

From what I can see, the safe approach here for forks is to either do a manual checkout or use something that only we approve, ie labels or some more complex action like the `ok-to-test` action that is also linked in one of the GitHub security articles. But yeah running our regular CI tests without secrets is OK with me - with the collaborator approval that is, just to also defend against spurious PRs costing us a lot of action minutes 😁

…

On Thu, Nov 18, 2021, 12:45 PM Phil Salant ***@***.***> wrote: ***@***.**** requested changes on this pull request. @NevilleS <https://github.com/NevilleS> 's suggestion about configuring the external integration tests to only run against PR's labeled safe-to-test (or similar) seems like a good potential route forward. As another option, we could create an additional, similar automation that restricts the execution of the external integration tests only to PRs opened either 1) from a branch off of the repo itself (not a fork) or 2) by a member of the Ethyca GH org and/or the @ethycadev user group. I don't see any way for us to safely automate the execution of any job that requires access to repository secrets when the PR being tested was opened by an untrusted user. For PRs opened by external contributors, I would recommend that an Ethyca employee first reviews the code changes here on GH, then, if everything seems safe, checkout the fork's branch locally and execute the external integration tests manually. Since all PRs require an approval from a maintainer before merging anyway, this extra step doesn't seem like *that* much to ask. As a last step, we should find a way to mark the external integration tests as "not run", and prevent the pipeline from displaying as passed, so we don't miss the previous manual step. ------------------------------ In .github/workflows/pr_checks.yml <#86 (comment)>: > @@ -1,5 +1,8 @@ name: Run CI -on: [push] +on: [ + push, + pull_request_target Building on this, I don't think it achieves what we want anyway. From the documentation linked by Sean <https://docs.github.com/en/actions/learn-github-actions/events-that-trigger-workflows#pull_request_target> : This event runs in the context of the base of the pull request, rather than in the merge commit as the pull_request event does. I could be misunderstanding, but I interpret this as "run our workflows against the commit from which the PR was branched", not the latest commit on the PR itself. It's intended to be used as more of a housekeeping trigger, to add labels etc. to PRs. Isn't the intent to run CI actions against the changeset on the PR? Separately, these changes, as written, don't seem to be using any of the additional permissions afforded by the pull_request_target trigger *except* to allow for execution of the external resource integration tests. Effectively, this change *only* serves to expose the ${{ secrets.REDSHIFT_TEST_URI }} and ${{ secrets.SNOWFLAKE_TEST_URI }} values to PRs opened from by potentially untrusted users (repo forks). — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#86 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAN72N276IWZIILV3PTUNBTUMU3VJANCNFSM5IJL42OA> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

seanpreston · 2021-11-18T18:06:22Z

To be thorough I have done some testing this afternoon by opening up a PR against this repo from a new account. I can confirm that the new account is unable to add labels:

iamkelllly · 2021-11-18T20:03:06Z

Given that we're referencing pull_request rather than pull_request_target then, can we update the PR title @seanpreston ?

seanpreston · 2021-11-19T12:18:05Z

As a last step, we should find a way to mark the external integration tests as "not run", and prevent the pipeline from displaying as passed, so we don't miss the previous manual step.

@PSalant726, it looks like this happens automatically:

UPDATE: and then once the label has been added:

We ought to stop every step from re-running each time a PR is labeled if we're also going to run them on push

…l name to better reflect the action

seanpreston · 2021-11-19T14:30:56Z

I've updated this PR to factor in a couple of changes requested by both @NevilleS and @PSalant726:

CI checks are now split into Safe and Unsafe categories
Safe CI checks are run on every pull_request event. This allows forked PRs to run them (?), and is also triggered every time a commit is added to the PR (as before).
Unsafe CI checks are triggered by the labeled type of pull_request events, and will only run if the label added contains the text run unsafe ci checks. NB. this category will still show a skipped label in the list of checks for push events, so the reviewer is aware that these checks have not run at the time of review.

seanpreston · 2021-11-19T14:38:15Z

+jobs:
+  Integration-Tests-External:
+    runs-on: ubuntu-latest
+    if: contains(github.event.pull_request.labels.*.name, 'run unsafe ci checks')


This condition was a bit confusing to me on first pass, so to break down for anyone else:

github.event.pull_request.labels.*.name this statement is applying a filter on the github.event.pull_request.labels var, pull out a list of label names, e.g. ["safe to run", "dont merge", ...].

contains(list, "value") is then check if "value" is an element of list.

It seems simple when you put it that way, but if you don't know about the filter stage, it could look as though github.event.pull_request.labels.*.name evaluates to a string and we're checking if that string contains the PR name (which would be less robust).

https://docs.github.com/en/actions/learn-github-actions/expressions#object-filters

NevilleS

This looks good to me now.

However, with the number of actions we're running, I think we need to combine some of these jobs together to make the whole thing run faster because we're definitely burning a ton of CI minutes on every single PR

I'll open a new issue for that

seanpreston · 2021-11-19T15:35:14Z

However, with the number of actions we're running, I think we need to combine some of these jobs together to make the whole thing run faster because we're definitely burning a ton of CI minutes on every single PR

The number of minutes burned might actually go down with this change, since we're switching from running the workflows on every push to only events related to pull requests.

PSalant726

@NevilleS -

However, with the number of actions we're running, I think we need to combine some of these jobs together to make the whole thing run faster because we're definitely burning a ton of CI minutes on every single PR

Ideally the pipeline would:

Build the container artifacts in a build stage, and make them available to the other stages that require them
Execute static analysis tools and test harnesses in discreet pipeline lint and test stages, dependent on the build artifacts where needed (and, crucially, not dependent where not needed)

We have an issue in fidesctl to make a similar change.

@seanpreston -
It's not enough to block this PR for me, but I think a cleaner implementation would keep things in a single file with a generic name like Fidesops. The jobs are still discreet and can have if conditions applied independently.

seanpreston · 2021-11-19T16:48:41Z

It's not enough to block this PR for me, but I think a cleaner implementation would keep things in a single file with a generic name like Fidesops. The jobs are still discreet and can have if conditions applied independently.

I did have it like this initially. Because we're using both push and pull_request triggers, we'd end up needing custom if: ... conditionals on every job to be sure we're not running every job in both contexts. That was generating 15 job runs for every push which seemed excessive. There might be another way to configure it that I didn't discover, but this felt clean enough, and has the added benefit of anything unsafe being explicit and in its own file.

* trigger workflows to run for pull_request_target event * change to only pull_request * check safe to test label * what happens when a PR with the safe-to-test label is pushed to * split pr_checks into safe + unsafe * update names of jobs * try removing push trigger * add push trigger back so we can see the check as skipped, change label name to better reflect the action * add pull_request back to safe checks * remove push because pull_request synchronize should cover it * remove arbitrary change

trigger workflows to run for pull_request_target event

82371f6

seanpreston assigned eastandwestwind Nov 18, 2021

seanpreston changed the title ~~trigger workflows to run for pull_request_target event~~ trigger workflows to run for pull_request_target event Nov 18, 2021

NevilleS suggested changes Nov 18, 2021

View reviewed changes

change to only pull_request

4daa46b

PSalant726 suggested changes Nov 18, 2021

View reviewed changes

seanpreston added the DON'T MERGE label Nov 18, 2021

seanpreston changed the title ~~trigger workflows to run for pull_request_target event~~ trigger workflows to run for pull_request event Nov 19, 2021

check safe to test label

1f7cd63

seanpreston added the safe to test label Nov 19, 2021

what happens when a PR with the safe-to-test label is pushed to

d2057f5

seanpreston removed the safe to test label Nov 19, 2021

split pr_checks into safe + unsafe

e007cdb

seanpreston added the safe to test label Nov 19, 2021

Sean Preston added 2 commits November 19, 2021 13:25

update names of jobs

785162c

try removing push trigger

478ea61

seanpreston added safe to test and removed safe to test labels Nov 19, 2021

add push trigger back so we can see the check as skipped, change labe…

912f1d4

…l name to better reflect the action

seanpreston added run unsafe ci checks Triggers running of unsafe CI checks and removed safe to test labels Nov 19, 2021

Sean Preston added 2 commits November 19, 2021 13:51

add pull_request back to safe checks

910fc09

remove push because pull_request synchronize should cover it

3a97b35

seanpreston removed the run unsafe ci checks Triggers running of unsafe CI checks label Nov 19, 2021

seanpreston added the run unsafe ci checks Triggers running of unsafe CI checks label Nov 19, 2021

seanpreston added run unsafe ci checks Triggers running of unsafe CI checks and removed run unsafe ci checks Triggers running of unsafe CI checks labels Nov 19, 2021

seanpreston commented Nov 19, 2021

View reviewed changes

NevilleS approved these changes Nov 19, 2021

View reviewed changes

Comment thread src/fidesops/main.py Outdated

remove arbitrary change

884239c

NevilleS mentioned this pull request Nov 19, 2021

Optimize CI actions to use fewer minutes (/w less parallelization or more caching) #90

Closed

PSalant726 approved these changes Nov 19, 2021

View reviewed changes

NevilleS merged commit 5c2caf1 into main Nov 19, 2021

NevilleS deleted the seanpreston-26-enable-ci branch November 19, 2021 16:38

Conversation

seanpreston commented Nov 18, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Ticket

Uh oh!

NevilleS commented Nov 18, 2021

Uh oh!

NevilleS Nov 18, 2021

Choose a reason for hiding this comment

Uh oh!

PSalant726 Nov 18, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

seanpreston commented Nov 18, 2021

Uh oh!

PSalant726 left a comment

Choose a reason for hiding this comment

Uh oh!

PSalant726 Nov 18, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NevilleS commented Nov 18, 2021 via email

Uh oh!

seanpreston commented Nov 18, 2021

Uh oh!

iamkelllly commented Nov 18, 2021

Uh oh!

seanpreston commented Nov 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seanpreston commented Nov 19, 2021

Uh oh!

seanpreston Nov 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NevilleS left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

seanpreston commented Nov 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PSalant726 left a comment

Choose a reason for hiding this comment

Uh oh!

seanpreston commented Nov 19, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

seanpreston commented Nov 18, 2021 •

edited

Loading

PSalant726 Nov 18, 2021 •

edited

Loading

PSalant726 Nov 18, 2021 •

edited

Loading

seanpreston commented Nov 19, 2021 •

edited

Loading

seanpreston Nov 19, 2021 •

edited

Loading

NevilleS left a comment •

edited

Loading

seanpreston commented Nov 19, 2021 •

edited

Loading