Skip to content
Merged

far #53

Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
41 commits
Select commit Hold shift + click to select a range
72688ad
[doc] add booster docstring and fix autodoc (#3789)
ver217 May 22, 2023
d9393b8
[doc] add deprecated warning on doc Basics section (#3754)
Yanjia0 May 22, 2023
fe1561a
[doc] update gradient cliping document (#3778)
flybird11111 May 22, 2023
62c7e67
[format] applied code formatting on changed files in pull request 378…
github-actions[bot] May 22, 2023
4d29c0f
Fix/docker action (#3266)
liuzeming-yuxi May 22, 2023
788e07d
[workflow] fixed the docker build workflow (#3794)
FrankLeeeee May 22, 2023
f5c425c
fixed the example docstring for booster (#3795)
FrankLeeeee May 22, 2023
ef02d7e
[doc] update gradient accumulation (#3771)
flybird11111 May 23, 2023
ad93c73
[workflow] enable testing for develop & feature branch (#3801)
FrankLeeeee May 23, 2023
615e2e5
[test] fixed lazy init test import error (#3799)
FrankLeeeee May 23, 2023
e871e34
[API] add docstrings and initialization to apex amp, naive amp (#3783)
flybird11111 May 23, 2023
9265f2d
[NFC]fix typo colossalai/auto_parallel nn utils etc. (#3779)
digger-yu May 23, 2023
8c62e50
[doc] update amp document
flybird11111 May 23, 2023
1167bf5
[doc] update amp document
flybird11111 May 23, 2023
a520610
[doc] update amp document
flybird11111 May 23, 2023
75272ef
[doc] add removed warning
flybird11111 May 23, 2023
c425a69
[doc] add removed change of config.py
flybird11111 May 23, 2023
6b305a9
[booster] torch fsdp fix ckpt (#3788)
wukong1992 May 23, 2023
19d1530
[doc] add warning about fsdp plugin (#3813)
ver217 May 23, 2023
1e3b64f
[workflow] enblaed doc build from a forked repo (#3815)
FrankLeeeee May 23, 2023
8aa1fb2
[doc]fix
flybird11111 May 23, 2023
278fcbc
[doc]fix
flybird11111 May 23, 2023
725365f
Merge pull request #3810 from jiangmingyan/amp
flybird11111 May 23, 2023
7f8203a
fix typo colossalai/auto_parallel autochunk fx/passes etc. (#3808)
digger-yu May 24, 2023
269150b
[Docker] Fix a couple of build issues (#3691)
ymwangg May 24, 2023
05b8a8d
[workflow] changed to doc build to be on schedule and release (#3825)
FrankLeeeee May 24, 2023
3496637
[evaluation] add automatic evaluation pipeline (#3821)
chengeharrison May 24, 2023
e90fdb1
fix typo docs/
digger-yu May 24, 2023
518b31c
[docs] change placememt_policy to placement_policy (#3829)
digger-yu May 24, 2023
84500b7
[workflow] fixed testmon cache in build CI (#3806)
FrankLeeeee May 24, 2023
3229f93
[booster] add warning for torch fsdp plugin doc (#3833)
wukong1992 May 25, 2023
54e97ed
[workflow] supported test on CUDA 10.2 (#3841)
FrankLeeeee May 25, 2023
a64df3f
[doc] update document of gemini instruction. (#3842)
flybird11111 May 25, 2023
e2d81eb
[nfc] fix typo colossalai/ applications/ (#3831)
digger-yu May 25, 2023
d42b1be
[release] bump to v0.3.0 (#3830)
FrankLeeeee May 25, 2023
ae959a7
[workflow] fixed workflow check for docker build (#3849)
FrankLeeeee May 25, 2023
b047487
[doc] update nvme offload documents. (#3850)
flybird11111 May 25, 2023
2506e27
[evaluation] improvement on evaluation (#3862)
chengeharrison May 30, 2023
5f79008
[example] update gemini examples (#3868)
flybird11111 May 30, 2023
281b33f
[doc] update document of zero with chunk. (#3855)
flybird11111 May 30, 2023
70c8cde
[nfc] fix typo colossalai/cli fx kernel (#3847)
digger-yu Jun 2, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions .github/workflows/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@
- [Compatibility Test on Dispatch](#compatibility-test-on-dispatch)
- [Release](#release)
- [User Friendliness](#user-friendliness)
- [Commmunity](#commmunity)
- [Community](#community)
- [Configuration](#configuration)
- [Progress Log](#progress-log)

Expand Down Expand Up @@ -97,7 +97,7 @@ This workflow is triggered by manually dispatching the workflow. It has the foll
| `Synchronize submodule` | `submodule.yml` | This workflow will check if any git submodule is updated. If so, it will create a PR to update the submodule pointers. |
| `Close inactive issues` | `close_inactive.yml` | This workflow will close issues which are stale for 14 days. |

### Commmunity
### Community

| Workflow Name | File name | Description |
| -------------------------------------------- | -------------------------------- | -------------------------------------------------------------------------------- |
Expand Down
29 changes: 16 additions & 13 deletions .github/workflows/build_on_pr.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,24 +3,27 @@ name: Build on PR
on:
pull_request:
types: [synchronize, opened, reopened]
branches:
- "main"
- "develop"
- "feature/**"
paths:
- '.github/workflows/build_on_pr.yml' # run command & env variables change
- 'colossalai/**' # source code change
- '!colossalai/**.md' # ignore doc change
- 'op_builder/**' # cuda extension change
- '!op_builder/**.md' # ignore doc change
- 'requirements/**' # requirements change
- 'tests/**' # test change
- '!tests/**.md' # ignore doc change
- 'pytest.ini' # test config change
- 'setup.py' # install command change
- ".github/workflows/build_on_pr.yml" # run command & env variables change
- "colossalai/**" # source code change
- "!colossalai/**.md" # ignore doc change
- "op_builder/**" # cuda extension change
- "!op_builder/**.md" # ignore doc change
- "requirements/**" # requirements change
- "tests/**" # test change
- "!tests/**.md" # ignore doc change
- "pytest.ini" # test config change
- "setup.py" # install command change

jobs:
detect:
name: Detect file change
if: |
github.event.pull_request.draft == false &&
github.base_ref == 'main' &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI'
outputs:
changedExtenisonFiles: ${{ steps.find-extension-change.outputs.all_changed_files }}
Expand Down Expand Up @@ -133,7 +136,7 @@ jobs:
- name: Restore Testmon Cache
run: |
if [ -d /github/home/testmon_cache ]; then
[ ! -z "$(ls -A /github/home/testmon_cache)" ] && cp -p -r /github/home/testmon_cache/.testmondata /__w/ColossalAI/ColossalAI/
[ ! -z "$(ls -A /github/home/testmon_cache)" ] && cp -p -r /github/home/testmon_cache/.testmondata* /__w/ColossalAI/ColossalAI/
fi

- name: Execute Unit Testing
Expand All @@ -147,7 +150,7 @@ jobs:
- name: Store Testmon Cache
run: |
[ -d /github/home/testmon_cache ] || mkdir /github/home/testmon_cache
cp -p -r /__w/ColossalAI/ColossalAI/.testmondata /github/home/testmon_cache/
cp -p -r /__w/ColossalAI/ColossalAI/.testmondata* /github/home/testmon_cache/

- name: Collate artifact
env:
Expand Down
47 changes: 29 additions & 18 deletions .github/workflows/compatiblity_test_on_dispatch.yml
Original file line number Diff line number Diff line change
Expand Up @@ -19,26 +19,26 @@ jobs:
outputs:
matrix: ${{ steps.set-matrix.outputs.matrix }}
steps:
- id: set-matrix
env:
TORCH_VERSIONS: ${{ inputs.torch_version }}
CUDA_VERSIONS: ${{ inputs.cuda_version }}
run: |
IFS=','
DOCKER_IMAGE=()
- id: set-matrix
env:
TORCH_VERSIONS: ${{ inputs.torch_version }}
CUDA_VERSIONS: ${{ inputs.cuda_version }}
run: |
IFS=','
DOCKER_IMAGE=()

for tv in $TORCH_VERSIONS
do
for cv in $CUDA_VERSIONS
do
DOCKER_IMAGE+=("\"hpcaitech/pytorch-cuda:${tv}-${cv}\"")
done
done
for tv in $TORCH_VERSIONS
do
for cv in $CUDA_VERSIONS
do
DOCKER_IMAGE+=("\"hpcaitech/pytorch-cuda:${tv}-${cv}\"")
done
done

container=$( IFS=',' ; echo "${DOCKER_IMAGE[*]}" )
container="[${container}]"
echo "$container"
echo "::set-output name=matrix::{\"container\":$(echo "$container")}"
container=$( IFS=',' ; echo "${DOCKER_IMAGE[*]}" )
container="[${container}]"
echo "$container"
echo "::set-output name=matrix::{\"container\":$(echo "$container")}"

build:
name: Test for PyTorch Compatibility
Expand Down Expand Up @@ -70,6 +70,17 @@ jobs:
- uses: actions/checkout@v2
with:
ssh-key: ${{ secrets.SSH_KEY_FOR_CI }}
- name: Download cub for CUDA 10.2
run: |
CUDA_VERSION=$(cat $CUDA_HOME/version.txt | grep "CUDA Version" | awk '{print $NF}' | cut -d. -f1,2)

# check if it is CUDA 10.2
# download cub
if [ "$CUDA_VERSION" = "10.2" ]; then
wget https://github.com/NVIDIA/cub/archive/refs/tags/1.8.0.zip
unzip 1.8.0.zip
cp -r cub-1.8.0/cub/ colossalai/kernel/cuda_native/csrc/kernels/include/
fi
- name: Install Colossal-AI
run: |
pip install -r requirements/requirements.txt
Expand Down
16 changes: 14 additions & 2 deletions .github/workflows/compatiblity_test_on_pr.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,8 @@ name: Compatibility Test on PR
on:
pull_request:
paths:
- 'version.txt'
- '.compatibility'
- "version.txt"
- ".compatibility"

jobs:
matrix_preparation:
Expand Down Expand Up @@ -58,6 +58,18 @@ jobs:
- uses: actions/checkout@v2
with:
ssh-key: ${{ secrets.SSH_KEY_FOR_CI }}
- name: Download cub for CUDA 10.2
run: |
CUDA_VERSION=$(cat $CUDA_HOME/version.txt | grep "CUDA Version" | awk '{print $NF}' | cut -d. -f1,2)

# check if it is CUDA 10.2
# download cub
if [ "$CUDA_VERSION" = "10.2" ]; then
wget https://github.com/NVIDIA/cub/archive/refs/tags/1.8.0.zip
unzip 1.8.0.zip
cp -r cub-1.8.0/cub/ colossalai/kernel/cuda_native/csrc/kernels/include/
fi

- name: Install Colossal-AI
run: |
pip install -v --no-cache-dir .
Expand Down
Original file line number Diff line number Diff line change
@@ -1,18 +1,16 @@
name: Build Documentation After Merge
name: Build Documentation On Schedule & After Release

on:
workflow_dispatch:
pull_request:
paths:
- 'version.txt'
- 'docs/**'
types:
- closed
schedule:
- cron: "0 12 * * *" # build doc every day at 8pm Singapore time (12pm UTC time)
release:
types: [published]

jobs:
build-doc:
name: Trigger Documentation Build Workflow
if: ( github.event_name == 'workflow_dispatch' || github.event.pull_request.merged == true ) && github.repository == 'hpcaitech/ColossalAI'
if: github.repository == 'hpcaitech/ColossalAI'
runs-on: ubuntu-latest
steps:
- name: trigger workflow in ColossalAI-Documentation
Expand Down
27 changes: 14 additions & 13 deletions .github/workflows/doc_check_on_pr.yml
Original file line number Diff line number Diff line change
Expand Up @@ -2,47 +2,49 @@ name: Check Documentation on PR

on:
pull_request:
branches:
- "main"
- "develop"
- "feature/**"
paths:
- 'docs/**'
- "docs/**"

jobs:
check-i18n:
name: Check docs in diff languages
if: |
github.event.pull_request.draft == false &&
github.base_ref == 'main' &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI'
github.event.pull_request.draft == false &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI'
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v2

- uses: actions/setup-python@v2
with:
python-version: '3.8.14'
python-version: "3.8.14"

- run: python .github/workflows/scripts/check_doc_i18n.py -d docs/source

check-doc-build:
name: Test if the docs can be built
if: |
github.event.pull_request.draft == false &&
github.base_ref == 'main' &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI'
github.event.pull_request.draft == false &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI'
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v2
with:
path: './ColossalAI'
path: "./ColossalAI"
fetch-depth: 0

- uses: actions/checkout@v2
with:
path: './ColossalAI-Documentation'
repository: 'hpcaitech/ColossalAI-Documentation'
path: "./ColossalAI-Documentation"
repository: "hpcaitech/ColossalAI-Documentation"

- uses: actions/setup-python@v2
with:
python-version: '3.8.14'
python-version: "3.8.14"

# we use the versions in the main branch as the guide for versions to display
# checkout will give your merged branch
Expand All @@ -57,7 +59,6 @@ jobs:
git config user.name 'github-actions'
git config user.email 'github-actions@github.com'


- name: Build docs
run: |
cache_dir=ColossalAI-Documentation/doc-build/.cache
Expand Down
30 changes: 16 additions & 14 deletions .github/workflows/doc_test_on_pr.yml
Original file line number Diff line number Diff line change
@@ -1,17 +1,20 @@
name: Test Documentation on PR
on:
pull_request:
branches:
- "main"
- "develop"
- "feature/**"
# any change in the examples folder will trigger check for the corresponding example.
paths:
- 'docs/source/**.md'
- "docs/source/**.md"

jobs:
# This is for changed example files detect and output a matrix containing all the corresponding directory name.
detect-changed-doc:
if: |
github.event.pull_request.draft == false &&
github.base_ref == 'main' &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI' && github.event_name == 'pull_request'
github.event.pull_request.draft == false &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI' && github.event_name == 'pull_request'
runs-on: ubuntu-latest
outputs:
any_changed: ${{ steps.changed-files.outputs.any_changed }}
Expand All @@ -26,10 +29,10 @@ jobs:
- name: Locate base commit
id: locate-base-sha
run: |
curBranch=$(git rev-parse --abbrev-ref HEAD)
commonCommit=$(git merge-base origin/main $curBranch)
echo $commonCommit
echo "baseSHA=$commonCommit" >> $GITHUB_OUTPUT
curBranch=$(git rev-parse --abbrev-ref HEAD)
commonCommit=$(git merge-base origin/main $curBranch)
echo $commonCommit
echo "baseSHA=$commonCommit" >> $GITHUB_OUTPUT

- name: Get all changed example files
id: changed-files
Expand All @@ -43,10 +46,9 @@ jobs:
check-changed-doc:
# Add this condition to avoid executing this job if the trigger event is workflow_dispatch.
if: |
github.event.pull_request.draft == false &&
github.base_ref == 'main' &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI' && github.event_name == 'pull_request' &&
needs.detect-changed-doc.outputs.any_changed == 'true'
github.event.pull_request.draft == false &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI' && github.event_name == 'pull_request' &&
needs.detect-changed-doc.outputs.any_changed == 'true'
name: Test the changed Doc
needs: detect-changed-doc
runs-on: [self-hosted, gpu]
Expand All @@ -61,8 +63,8 @@ jobs:
- name: Checkout ColossalAI-Documentation
uses: actions/checkout@v2
with:
path: './ColossalAI-Documentation'
repository: 'hpcaitech/ColossalAI-Documentation'
path: "./ColossalAI-Documentation"
repository: "hpcaitech/ColossalAI-Documentation"

- name: Install Docer
run: |
Expand Down
26 changes: 14 additions & 12 deletions .github/workflows/example_check_on_pr.yml
Original file line number Diff line number Diff line change
@@ -1,17 +1,20 @@
name: Test Example on PR
on:
pull_request:
branches:
- "main"
- "develop"
- "feature/**"
# any change in the examples folder will trigger check for the corresponding example.
paths:
- 'examples/**'
- "examples/**"

jobs:
# This is for changed example files detect and output a matrix containing all the corresponding directory name.
detect-changed-example:
if: |
github.event.pull_request.draft == false &&
github.base_ref == 'main' &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI' && github.event_name == 'pull_request'
github.event.pull_request.draft == false &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI' && github.event_name == 'pull_request'
runs-on: ubuntu-latest
outputs:
matrix: ${{ steps.setup-matrix.outputs.matrix }}
Expand All @@ -26,10 +29,10 @@ jobs:
- name: Locate base commit
id: locate-base-sha
run: |
curBranch=$(git rev-parse --abbrev-ref HEAD)
commonCommit=$(git merge-base origin/main $curBranch)
echo $commonCommit
echo "baseSHA=$commonCommit" >> $GITHUB_OUTPUT
curBranch=$(git rev-parse --abbrev-ref HEAD)
commonCommit=$(git merge-base origin/main $curBranch)
echo $commonCommit
echo "baseSHA=$commonCommit" >> $GITHUB_OUTPUT

- name: Get all changed example files
id: changed-files
Expand Down Expand Up @@ -61,10 +64,9 @@ jobs:
check-changed-example:
# Add this condition to avoid executing this job if the trigger event is workflow_dispatch.
if: |
github.event.pull_request.draft == false &&
github.base_ref == 'main' &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI' && github.event_name == 'pull_request' &&
needs.detect-changed-example.outputs.anyChanged == 'true'
github.event.pull_request.draft == false &&
github.event.pull_request.base.repo.full_name == 'hpcaitech/ColossalAI' && github.event_name == 'pull_request' &&
needs.detect-changed-example.outputs.anyChanged == 'true'
name: Test the changed example
needs: detect-changed-example
runs-on: [self-hosted, gpu]
Expand Down
Loading