Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
73 commits
Select commit Hold shift + click to select a range
bf874bf
Add dpo. Fix sft, ppo, lora. Refactor all
YeAnbang Nov 21, 2023
e5b0d4e
fix and tested ppo
YeAnbang Nov 30, 2023
b96e1a8
2 nd round refactor
YeAnbang Nov 30, 2023
45d75c0
add ci tests
YeAnbang Dec 1, 2023
3422307
fix ci
YeAnbang Dec 4, 2023
1c64eb4
fix ci
YeAnbang Dec 5, 2023
f96a036
fix readme, style
YeAnbang Dec 6, 2023
aed86b7
fix readme style
YeAnbang Dec 6, 2023
e3bc911
fix style, fix benchmark
YeAnbang Dec 6, 2023
fb62add
reproduce benchmark result, remove useless files
YeAnbang Dec 7, 2023
7a09724
rename to ColossalChat
YeAnbang Dec 7, 2023
d49a4b2
use new image
YeAnbang Dec 8, 2023
ae2259c
fix ci workflow
YeAnbang Dec 11, 2023
f7a4327
fix ci
YeAnbang Dec 11, 2023
4e04b7c
use local model/tokenizer for ci tests
YeAnbang Dec 12, 2023
b3d7924
fix ci
YeAnbang Dec 12, 2023
d078f28
fix ci
YeAnbang Dec 12, 2023
43f8f6a
fix ci
YeAnbang Dec 13, 2023
5761c00
fix ci timeout
YeAnbang Dec 13, 2023
9d01e6a
fix rm progress bar. fix ci timeout
YeAnbang Dec 13, 2023
b3ae6e1
fix ci
YeAnbang Dec 14, 2023
bb55cb3
fix ci typo
YeAnbang Dec 14, 2023
10f17db
remove 3d plugin from ci temporary
YeAnbang Dec 14, 2023
0a69a60
test environment
YeAnbang Jan 9, 2024
ebc216c
cannot save optimizer
YeAnbang Jan 16, 2024
d8e2703
support chat template
YeAnbang Jan 19, 2024
68bd55f
fix readme
YeAnbang Jan 19, 2024
911c146
fix path
YeAnbang Jan 19, 2024
95d8b4f
test ci locally
YeAnbang Jan 22, 2024
1432062
restore build_or_pr
YeAnbang Jan 22, 2024
4da39bb
fix ci data path
YeAnbang Jan 22, 2024
e37bd05
fix benchmark
YeAnbang Jan 23, 2024
52179e6
fix ci, move ci tests to 3080, disable fast tokenizer
YeAnbang Jan 23, 2024
fc0ae0c
move ci to 85
YeAnbang Jan 23, 2024
d7cd36a
support flash attention 2
YeAnbang Jan 24, 2024
41d58c0
add all-in-one data preparation script. Fix colossal-llama2-chat chat…
YeAnbang Jan 26, 2024
18eebc9
add hardware requirements
YeAnbang Jan 29, 2024
e891595
move ci test data
YeAnbang Jan 29, 2024
2f93646
merge
YeAnbang Jan 29, 2024
7b437d1
fix save_model, add unwrap
YeAnbang Jan 30, 2024
8f9e9b5
fix missing bos
YeAnbang Feb 2, 2024
6c5086b
fix missing bos; support grad accumulation with gemini
YeAnbang Feb 2, 2024
610e063
merge
YeAnbang Feb 2, 2024
173752b
fix ci
YeAnbang Feb 6, 2024
9e3b9df
fix ci
YeAnbang Feb 6, 2024
5a165ee
fix ci
YeAnbang Feb 6, 2024
6a78285
fix llama2 chat template config
YeAnbang Feb 6, 2024
e813386
debug sft
YeAnbang Feb 6, 2024
56aa260
debug sft
YeAnbang Feb 6, 2024
70cddb0
fix colossalai version requirement
YeAnbang Feb 19, 2024
71f2ca2
Merge branch 'coati_v2' of https://github.com/YeAnbang/ColossalAI int…
YeAnbang Feb 19, 2024
be5e262
fix ci
YeAnbang Feb 20, 2024
1c07166
add sanity check to prevent NaN loss
YeAnbang Feb 20, 2024
c0c6940
Merge branch 'main' of https://github.com/YeAnbang/ColossalAI into co…
YeAnbang Feb 20, 2024
dcafcd3
fix requirements
YeAnbang Feb 20, 2024
4d2e13e
add dummy data generation script
YeAnbang Feb 20, 2024
ef3e4eb
add dummy data generation script
YeAnbang Feb 20, 2024
5c56904
add dummy data generation script
YeAnbang Feb 20, 2024
7db239a
add dummy data generation script
YeAnbang Feb 20, 2024
11bd74d
update readme
TongLi3701 Mar 14, 2024
4144aed
update readme
TongLi3701 Mar 19, 2024
0f8da48
update readme and ignore
TongLi3701 Mar 20, 2024
7dc6244
fix logger bug
TongLi3701 Mar 20, 2024
bb05639
support parallel_output
YeAnbang Mar 22, 2024
4208d61
Merge branch 'main' of https://github.com/YeAnbang/ColossalAI into co…
YeAnbang Mar 22, 2024
63d5a32
Merge branch 'coati_v2' of https://github.com/YeAnbang/ColossalAI int…
YeAnbang Mar 22, 2024
50863ca
modify data preparation logic
YeAnbang Mar 22, 2024
8ab6ceb
fix tokenization
YeAnbang Mar 25, 2024
07e1177
update lr
YeAnbang Mar 25, 2024
b2fa7f6
Merge branch 'coati_v2' of https://github.com/YeAnbang/ColossalAI int…
TongLi3701 Mar 25, 2024
dcddb9b
fix inference
YeAnbang Mar 25, 2024
a66972c
run pre-commit
YeAnbang Mar 25, 2024
a32dde1
merge
YeAnbang Mar 25, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
29 changes: 19 additions & 10 deletions .github/workflows/run_chatgpt_examples.yml
Original file line number Diff line number Diff line change
Expand Up @@ -19,35 +19,44 @@ jobs:
runs-on: [self-hosted, gpu]
container:
image: hpcaitech/pytorch-cuda:2.1.0-12.1.0
options: --gpus all --rm -v /data/scratch/github_actions/chat:/data/scratch/github_actions/chat --shm-size=10.24gb
timeout-minutes: 30
options: --gpus all --rm -v /data/scratch/examples-data:/data/scratch/examples-data --shm-size=10.24gb
timeout-minutes: 60
defaults:
run:
shell: bash
steps:
- name: Checkout ColossalAI
uses: actions/checkout@v2

- name: Install Colossal-AI
run: |
BUILD_EXT=1 pip install -v -e .

- name: Install ChatGPT
run: |
cd applications/Chat
cd applications/ColossalChat
pip install -v .
export BUILD_EXT=1
pip install -r examples/requirements.txt

- name: Install Transformers
run: |
pip install transformers==4.30.2
pip install transformers==4.34.1

- name: Execute Examples
run: |
cd applications/Chat
cd applications/ColossalChat
rm -rf ~/.cache/colossalai
./tests/test_inference.sh
./tests/test_benchmarks.sh
mkdir models
mkdir sft_data
mkdir prompt_data
mkdir preference_data
./tests/test_data_preparation.sh
./tests/test_train.sh
env:
NCCL_SHM_DISABLE: 1
MAX_JOBS: 8
SFT_DATASET: /data/scratch/github_actions/chat/data.json
PROMPT_DATASET: /data/scratch/github_actions/chat/prompts_en.jsonl
PRETRAIN_DATASET: /data/scratch/github_actions/chat/alpaca_data.json
PRETRAINED_MODEL_PATH: ./models
SFT_DATASET: ./sft_data
PROMPT_DATASET: ./prompt_data
PREFERENCE_DATASET: ./preference_data
10 changes: 6 additions & 4 deletions .github/workflows/run_chatgpt_unit_tests.yml
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,7 @@ jobs:
runs-on: [self-hosted, gpu]
container:
image: hpcaitech/pytorch-cuda:2.1.0-12.1.0
options: --gpus all --rm -v /data/scratch/chatgpt:/data/scratch/chatgpt
options: --gpus all --rm -v /data/scratch/examples-data:/data/scratch/examples-data
timeout-minutes: 30
defaults:
run:
Expand All @@ -32,15 +32,17 @@ jobs:

- name: Install ChatGPT
run: |
cd applications/Chat
cd applications/ColossalChat
pip install -v .
pip install -r requirements-test.txt
pip install pytest

- name: Execute Unit Testing
run: |
cd applications/Chat
cd applications/ColossalChat
rm -rf ~/.cache/colossalai
pytest tests/
cd ./tests
./test_templating.sh
env:
NCCL_SHM_DISABLE: 1
MAX_JOBS: 8
4 changes: 4 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -159,3 +159,7 @@ coverage.xml
# ignore testmon and coverage files
.coverage
.testmondata*

# log, test files - ColossalChat
applications/ColossalChat/logs
applications/ColossalChat/tests/logs
38 changes: 0 additions & 38 deletions applications/Chat/benchmarks/README.md

This file was deleted.

208 changes: 0 additions & 208 deletions applications/Chat/benchmarks/benchmark_opt_lora_dummy.py

This file was deleted.

13 changes: 0 additions & 13 deletions applications/Chat/coati/dataset/__init__.py

This file was deleted.

Loading