Skip to content

Conversation

@adoroszlai
Copy link
Contributor

What changes were proposed in this pull request?

kubernetes check is failing in CI at ozone without any code change. The same test is passing locally on another Kubernetes distribution. The failure is probably caused by some environmental change related to persistent volumes. I propose to disable the failing test temporarily, until it can be fixed.

Last successful run: https://github.com/apache/ozone/runs/3137129344
First failed run: https://github.com/apache/ozone/runs/3140001398

https://issues.apache.org/jira/browse/HDDS-5492

How was this patch tested?

https://github.com/adoroszlai/hadoop-ozone/runs/3158417215

@adoroszlai adoroszlai self-assigned this Jul 26, 2021
@elek
Copy link
Member

elek commented Jul 26, 2021

🤔

Let me try to check if it can be fixed easily instead of disabling it

Copy link
Member

@elek elek left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I checked and it passes locally. Also found that still we have the other k8s tests which provides basic smoketest coverage.

I am convinced that it's not a regression in the code itself, I am fine to turn it off until it's fixed.

@adoroszlai
Copy link
Contributor Author

Thanks @elek for the review.

@adoroszlai adoroszlai merged commit 12958d3 into apache:master Jul 26, 2021
@GeorgeJahad
Copy link
Contributor

I think the problem is a change in the k3s tool, (which explains why you probably weren't seeing it locally, if you are running an old version.)

I have a fix here:
#2464

@adoroszlai adoroszlai deleted the HDDS-5492 branch July 27, 2021 06:01
errose28 added a commit to errose28/ozone that referenced this pull request Jul 30, 2021
* master: (48 commits)
  HDDS-5514. Skip check for UNHEALTHY containers for datanode finalize. (apache#2469)
  HDDS-5279. OFS mkdir -p does not work when Volume is not pre-created (apache#2412)
  HDDS-5328. Remove delete container command from admin CLI (apache#2456)
  HDDS-5382. Increase default container report interval to 60 mins (apache#2363)
  HDDS-5378 Add APIs to retrieve Namespace Summary from Recon (apache#2417)
  HDDS-5466. Refactor BlockOutputStream. (apache#2442)
  HDDS-5465. Delete redundant code when set、add and remove bucket acl (apache#2439)
  HDDS-5184. Use separate DB profile for Datanodes. (apache#2214)
  HDDS-5494. Reduce retry in Kubernetes test (apache#2461)
  HDDS-5414. Data buffers incorrectly filtered for Ozone Insight (apache#2387)
  HDDS-5450. Avoid refresh pipeline for S3 headObject (apache#2431)
  HDDS-5500. New k3s version breaks kubernetes test (apache#2464)
  HDDS-5489. Install OS-specific flekszible (apache#2462)
  Multi-raft style placement with permutations for offline data generator (apache#2434)
  HDDS-5484. Intermittent failure in TestReplicationManager#testMovePrerequisites (apache#2454)
  HDDS-5443 Create and then recreate a bucket with a randomized name (apache#2436)
  HDDS-5492. Disable failing kubernetes test (apache#2459)
  HDDS-4330. Bootstrap new OM node (apache#1494)
  HDDS-5418. Let Recon send reregisterCommand to Datanodes if DatanodeDetails changed (apache#2392)
  HDDS-5479. s3g bucket list failed when there is non-english key name. (apache#2450)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants