Skip to content

Integrating OpenTelemetry code#390

Merged
ayusheesingh-zz merged 446 commits intoci_otfrom
t-aysi/open-telemetry
Jul 15, 2020
Merged

Integrating OpenTelemetry code#390
ayusheesingh-zz merged 446 commits intoci_otfrom
t-aysi/open-telemetry

Conversation

@ayusheesingh-zz
Copy link

No description provided.

rashmichandrashekar and others added 30 commits January 11, 2019 13:19
* Fix KubeAPI Calls to filter to get pods for current node

* Reinstate log line
* disable telemetry option

* fix a typo
Custom Metrics changes to ci_feature
…netes namespace, as it might be confused with metrics namespace in Metrics Explorer (#194)

* Zero Fill for Pod Counts by Phase

* Change namespace dimension to Kubernetes namespace
* Updatng release history

* fixing the plugin logs for emit stream

* updating log message

* Remove Log Processing from fluentd configuration

* Remove plugin references from base_container.data

* Dilipr/fluent bit log processing (#126)

* Build out_oms.so and include in docker-cimprov package

* Adding fluent-bit-config file to base container

* PR Feedback

* Adding out_oms.conf to base_container.data

* PR Feedback

* Making the critical section as small as possible

* PR Feedback

* Fixing the newline bug for Computer, and changing containerId to Id

* Dilipr/glide updates (#127)

* Updating glide.* files to include lumberjack

* containerID="" for pull issues

* Using KubeAPI for getting image,name. Adding more logs (#129)

* Using KubeAPI for getting image,name. Adding more logs

* Moving log file and state file to within the omsagent container

* Changing log and state paths

* Dilipr/mark comments (#130)

* Marks Comments + Error Handling

* Drop records from files that are not in k8s format

* Remove unnecessary log line'

* Adding Log to the file that doesn't conform to the expected format

* Rashmi/segfault latest (#132)

* adding null checks in all providers

* fixing type

* fixing type

* adding more null checks

* update cjson

* Adding a missed null check (#135)

* reusing some variables (#136)

* Rashmi/cjson delete null check (#138)

* adding null check for cjson-delete

* null chk

* removing null check

* updating log level to debug for some provider workflows (#139)

* Fixing CPU Utilization and removing Fluent-bit filters (#140)

Removing fluent-bit filters, CPU optimizations

* Minor tweaks 1. Remove some logging 2. Added more Error Handling 3. Continue when there is an error with k8s api (#141)

* Removing some logs, added more error checking, continue on kube-api error

* Return FLB OK for json Marshall error, instead of RETRY

* * Change FluentBit flush interval to 30 secs (from 5 secs)
* Remove ContainerPerf, ContainerServiceLog,ContainerProcess (OMI workflows) for Daemonset

* Container Log Telemetry

* Fixing an issue with Send Init Event if Telemetry is not initialized properly, tab to whitespace in conf file

* PR feedback

* PR feedback

* Sending an event every 5 mins(Heartbeat) (#146)

* PR feedback to cleanup removed workflows

* updating agent version for telemetry

* updating agent version

* Telemetry Updates (#149)

* Telemetry Fixes 1. Added Log Generation Rate 2. Fixed parsing bugs 3. Added code to send Exceptions/errors

* PR Feedback

* Changes to send omsagent/omsagent-rs kubectl logs to App Insights (#159)

* Changes to send omsagent/omsagent-rs kubectl logs to App Insights

* PR Feedback

* Rashmi/fluentd docker inventory (#160)

* first stab

* changes

* changes

* docker util changes

* working tested util

* input plugin and conf

* changes

* changes

* changes

* changes

* changes

* working containerinventory

* fixing omi removal from container.conf

* removing comments

* file write and read

* deleted containers working

* changes

* changes

* socket timeout

* deleting test files

* adding log

* fixing comment

* appinsights changes

* changes

* tel changes

* changes

* changes

* changes

* changes

* lib changes

* changes

* changes

* fixes

* PR comments

* changes

* updating the ownership

* changes

* changes

* changes to container data

* removing comment

* changes

* adding collection time

* bug fix

* env string truncation

* changes for acs-engine test

* Fix Telemetry Bug -- Initialize Telemetry Client after Initializing all required properties (#162)

* Fix kube events memory leak due to yaml serialization for > 5k events (#163)

* Setting Timeout for HTTP Client  in PostDataHelper in outoms go plugin(#164)

* Vishwa/perftelemetry 2 (#165)

* add cpu usage telemetry for ds & rs

* add cpu & memory usage telemetry for ds & rs

* environment variable fix (#166)

* environment variable fix

* updating agent version

* Fixing a bug where we were crashing due to container statuses not present when not was lost (#167)

* Updating title

* updating right versions for last release

* Updating the break condition to look for end of response (#168)

* Updating the break condition to look for end of response

* changes for docker response

* updating AgentVersion for telemetry

* Updating readme for latest release changes

* Changes - (#173)

* use /var/log for state
* new metric ContainerLogsAgentSideLatencyMs
* new field 'timeOfComand'

* Rashmi/kubenodeinventory (#174)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* Get cpuusage from usageseconds (#175)

* Rashmi/kubenodeinventory (#176)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* updating socket to the new mount location

* Adding exception telemetry and heartbeat

* changes to fix controller type

* Fixing typo

* fixing method signature

* updating plugins to get controller type from env

* fixing bugs

* Rashmi/kubenodeinventory (#178)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* updating socket to the new mount location

* Adding exception telemetry and heartbeat

* changes to fix controller type

* Fixing typo

* fixing method signature

* updating plugins to get controller type from env

* fixing bugs

* changes to fixed type

* removing comments

* changes for fixed type

* Fixing an issue on the cpurate metric, which happens for the first time (when cache is empty) (#179)

* Rashmi/kubenodeinventory (#180)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* updating socket to the new mount location

* Adding exception telemetry and heartbeat

* changes to fix controller type

* Fixing typo

* fixing method signature

* updating plugins to get controller type from env

* fixing bugs

* changes to fixed type

* removing comments

* changes for fixed type

* adding kubelet version as a dimension

* Exclude docker containers from container inventory (#181)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* updating socket to the new mount location

* Adding exception telemetry and heartbeat

* changes to fix controller type

* Fixing typo

* fixing method signature

* updating plugins to get controller type from env

* fixing bugs

* changes to fixed type

* removing comments

* changes for fixed type

* adding kubelet version as a dimension

* Excluding raw docker containers from container inventory

* making labels key case insensitive

* make poduid label case insensitive

* Exclude pauseamd64 containers from container inventory (#182)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* updating socket to the new mount location

* Adding exception telemetry and heartbeat

* changes to fix controller type

* Fixing typo

* fixing method signature

* updating plugins to get controller type from env

* fixing bugs

* changes to fixed type

* removing comments

* changes for fixed type

* adding kubelet version as a dimension

* Excluding raw docker containers from container inventory

* making labels key case insensitive

* make poduid label case insensitive

* changes to exclude pause amd 64 containers

* Update agent version

* Updating readme for the latest release

* Fix indentation in kube.conf and update readme (#184)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* updating socket to the new mount location

* Adding exception telemetry and heartbeat

* changes to fix controller type

* Fixing typo

* fixing method signature

* updating plugins to get controller type from env

* fixing bugs

* changes to fixed type

* removing comments

* changes for fixed type

* adding kubelet version as a dimension

* Excluding raw docker containers from container inventory

* making labels key case insensitive

* make poduid label case insensitive

* changes to exclude pause amd 64 containers

* fixing indentation so that kube.conf contents can be used in config map in the yaml

* updating readme to fix date and agent version

* updating agent tag

* Get Pods for current Node Only (#185)

* Fix KubeAPI Calls to filter to get pods for current node

* Reinstate log line

* changes for container node inventory fixed type (#186)

* Fix for mooncake (disable telemetry optionally) (#191)

* disable telemetry option

* fix a typo

* CustomMetrics to ci_feature (#193)

Custom Metrics changes to ci_feature

* add ContainerNotRunning column to KubePodInventory

* merge pr feedback: update name to ContainerStatusReason

* Zero Fill for Missing Pod Phases, Change Namespace Dimension to Kubernetes namespace, as it might be confused with metrics namespace in Metrics Explorer (#194)

* Zero Fill for Pod Counts by Phase

* Change namespace dimension to Kubernetes namespace

* No Retries for non 404 4xx errors (#196)

* Update agent version for telemetry

* Update readme for upcoming (ciprod01202019) release

* fix readme formatting

* fix formatting for readme

* fix formatting for readme

* fix readme

* fix readme

* fix agent version for telemetry
* Rotate logs every 10MB instead of weekly

* Removing some logging, fixed log rotation
* Updatng release history

* fixing the plugin logs for emit stream

* updating log message

* Remove Log Processing from fluentd configuration

* Remove plugin references from base_container.data

* Dilipr/fluent bit log processing (#126)

* Build out_oms.so and include in docker-cimprov package

* Adding fluent-bit-config file to base container

* PR Feedback

* Adding out_oms.conf to base_container.data

* PR Feedback

* Making the critical section as small as possible

* PR Feedback

* Fixing the newline bug for Computer, and changing containerId to Id

* Dilipr/glide updates (#127)

* Updating glide.* files to include lumberjack

* containerID="" for pull issues

* Using KubeAPI for getting image,name. Adding more logs (#129)

* Using KubeAPI for getting image,name. Adding more logs

* Moving log file and state file to within the omsagent container

* Changing log and state paths

* Dilipr/mark comments (#130)

* Marks Comments + Error Handling

* Drop records from files that are not in k8s format

* Remove unnecessary log line'

* Adding Log to the file that doesn't conform to the expected format

* Rashmi/segfault latest (#132)

* adding null checks in all providers

* fixing type

* fixing type

* adding more null checks

* update cjson

* Adding a missed null check (#135)

* reusing some variables (#136)

* Rashmi/cjson delete null check (#138)

* adding null check for cjson-delete

* null chk

* removing null check

* updating log level to debug for some provider workflows (#139)

* Fixing CPU Utilization and removing Fluent-bit filters (#140)

Removing fluent-bit filters, CPU optimizations

* Minor tweaks 1. Remove some logging 2. Added more Error Handling 3. Continue when there is an error with k8s api (#141)

* Removing some logs, added more error checking, continue on kube-api error

* Return FLB OK for json Marshall error, instead of RETRY

* * Change FluentBit flush interval to 30 secs (from 5 secs)
* Remove ContainerPerf, ContainerServiceLog,ContainerProcess (OMI workflows) for Daemonset

* Container Log Telemetry

* Fixing an issue with Send Init Event if Telemetry is not initialized properly, tab to whitespace in conf file

* PR feedback

* PR feedback

* Sending an event every 5 mins(Heartbeat) (#146)

* PR feedback to cleanup removed workflows

* updating agent version for telemetry

* updating agent version

* Telemetry Updates (#149)

* Telemetry Fixes 1. Added Log Generation Rate 2. Fixed parsing bugs 3. Added code to send Exceptions/errors

* PR Feedback

* Changes to send omsagent/omsagent-rs kubectl logs to App Insights (#159)

* Changes to send omsagent/omsagent-rs kubectl logs to App Insights

* PR Feedback

* Rashmi/fluentd docker inventory (#160)

* first stab

* changes

* changes

* docker util changes

* working tested util

* input plugin and conf

* changes

* changes

* changes

* changes

* changes

* working containerinventory

* fixing omi removal from container.conf

* removing comments

* file write and read

* deleted containers working

* changes

* changes

* socket timeout

* deleting test files

* adding log

* fixing comment

* appinsights changes

* changes

* tel changes

* changes

* changes

* changes

* changes

* lib changes

* changes

* changes

* fixes

* PR comments

* changes

* updating the ownership

* changes

* changes

* changes to container data

* removing comment

* changes

* adding collection time

* bug fix

* env string truncation

* changes for acs-engine test

* Fix Telemetry Bug -- Initialize Telemetry Client after Initializing all required properties (#162)

* Fix kube events memory leak due to yaml serialization for > 5k events (#163)

* Setting Timeout for HTTP Client  in PostDataHelper in outoms go plugin(#164)

* Vishwa/perftelemetry 2 (#165)

* add cpu usage telemetry for ds & rs

* add cpu & memory usage telemetry for ds & rs

* environment variable fix (#166)

* environment variable fix

* updating agent version

* Fixing a bug where we were crashing due to container statuses not present when not was lost (#167)

* Updating title

* updating right versions for last release

* Updating the break condition to look for end of response (#168)

* Updating the break condition to look for end of response

* changes for docker response

* updating AgentVersion for telemetry

* Updating readme for latest release changes

* Changes - (#173)

* use /var/log for state
* new metric ContainerLogsAgentSideLatencyMs
* new field 'timeOfComand'

* Rashmi/kubenodeinventory (#174)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* Get cpuusage from usageseconds (#175)

* Rashmi/kubenodeinventory (#176)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* updating socket to the new mount location

* Adding exception telemetry and heartbeat

* changes to fix controller type

* Fixing typo

* fixing method signature

* updating plugins to get controller type from env

* fixing bugs

* Rashmi/kubenodeinventory (#178)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* updating socket to the new mount location

* Adding exception telemetry and heartbeat

* changes to fix controller type

* Fixing typo

* fixing method signature

* updating plugins to get controller type from env

* fixing bugs

* changes to fixed type

* removing comments

* changes for fixed type

* Fixing an issue on the cpurate metric, which happens for the first time (when cache is empty) (#179)

* Rashmi/kubenodeinventory (#180)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* updating socket to the new mount location

* Adding exception telemetry and heartbeat

* changes to fix controller type

* Fixing typo

* fixing method signature

* updating plugins to get controller type from env

* fixing bugs

* changes to fixed type

* removing comments

* changes for fixed type

* adding kubelet version as a dimension

* Exclude docker containers from container inventory (#181)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* updating socket to the new mount location

* Adding exception telemetry and heartbeat

* changes to fix controller type

* Fixing typo

* fixing method signature

* updating plugins to get controller type from env

* fixing bugs

* changes to fixed type

* removing comments

* changes for fixed type

* adding kubelet version as a dimension

* Excluding raw docker containers from container inventory

* making labels key case insensitive

* make poduid label case insensitive

* Exclude pauseamd64 containers from container inventory (#182)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* updating socket to the new mount location

* Adding exception telemetry and heartbeat

* changes to fix controller type

* Fixing typo

* fixing method signature

* updating plugins to get controller type from env

* fixing bugs

* changes to fixed type

* removing comments

* changes for fixed type

* adding kubelet version as a dimension

* Excluding raw docker containers from container inventory

* making labels key case insensitive

* make poduid label case insensitive

* changes to exclude pause amd 64 containers

* Update agent version

* Updating readme for the latest release

* Fix indentation in kube.conf and update readme (#184)

* containernodeinventory changes

* changes for containernodeinventory

* changes to add node telemetry

* pod telemetry cahnges

* updated telemetry changes

* changes to get uid of owner references as controller id

* updating socket to the new mount location

* Adding exception telemetry and heartbeat

* changes to fix controller type

* Fixing typo

* fixing method signature

* updating plugins to get controller type from env

* fixing bugs

* changes to fixed type

* removing comments

* changes for fixed type

* adding kubelet version as a dimension

* Excluding raw docker containers from container inventory

* making labels key case insensitive

* make poduid label case insensitive

* changes to exclude pause amd 64 containers

* fixing indentation so that kube.conf contents can be used in config map in the yaml

* updating readme to fix date and agent version

* updating agent tag

* Get Pods for current Node Only (#185)

* Fix KubeAPI Calls to filter to get pods for current node

* Reinstate log line

* changes for container node inventory fixed type (#186)

* Fix for mooncake (disable telemetry optionally) (#191)

* disable telemetry option

* fix a typo

* CustomMetrics to ci_feature (#193)

Custom Metrics changes to ci_feature

* add ContainerNotRunning column to KubePodInventory

* merge pr feedback: update name to ContainerStatusReason

* Zero Fill for Missing Pod Phases, Change Namespace Dimension to Kubernetes namespace, as it might be confused with metrics namespace in Metrics Explorer (#194)

* Zero Fill for Pod Counts by Phase

* Change namespace dimension to Kubernetes namespace

* No Retries for non 404 4xx errors (#196)

* Update agent version for telemetry

* Update readme for upcoming (ciprod01202019) release

* fix readme formatting

* fix formatting for readme

* fix formatting for readme

* fix readme

* fix readme

* fix agent version for telemetry

* fix date in readme

* update readme

* Restart  logs every 10MB instead of weekly (#198)

* Rotate logs every 10MB instead of weekly

* Removing some logging, fixed log rotation

* update agent version for telemetry

* update readme
@ayusheesingh-zz ayusheesingh-zz merged commit dffcaff into ci_ot Jul 15, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants