Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
42 commits
Select commit Hold shift + click to select a range
0ca0313
Add compile-time constants (macros) for the MR and NR block sizes.
devinamatthews Jan 31, 2022
88dd4d6
PoC implementation of reference gemm kernel.
devinamatthews Feb 5, 2022
ac920eb
Simplify cntx_t structure.
devinamatthews Feb 6, 2022
ccf99db
Missed `bli_kernel_defs_zen3.h` somehow.
devinamatthews Feb 6, 2022
eebe527
Fix two ARM issues.
devinamatthews Feb 7, 2022
87b76c1
Merge remote-tracking branch 'origin/master' into mr_nr_macros_phase1
devinamatthews Feb 7, 2022
84af20d
Re-write reference packing kernels to make use of MR/NR values when a…
devinamatthews Feb 9, 2022
68ffb78
Fix missing `_ROW_PREF`s.
devinamatthews Feb 11, 2022
9f9700b
Merge cosmetic changes and fix packm_1er kernel.
devinamatthews Feb 12, 2022
0b86e10
New packing kernels completely working.
devinamatthews Feb 16, 2022
9baa417
Merge branch 'master' into mr_nr_macros
devinamatthews Feb 16, 2022
034d88c
Fix one last bug.
devinamatthews Feb 16, 2022
2dd8b38
Partial addition of `const` to all interfaces above the level of the …
devinamatthews Mar 7, 2022
80c132a
Finished adding const, now debugging.
devinamatthews Mar 15, 2022
97c5beb
Fix TRMM/TRSM bug.
devinamatthews Mar 15, 2022
479757c
Merge branch 'master' into const-roundup
devinamatthews Mar 15, 2022
1f119e4
Fix performance bug.
devinamatthews Mar 15, 2022
97d124a
Fix performance bug.
devinamatthews Mar 15, 2022
1200d8c
Fix trsm bug.
devinamatthews Mar 15, 2022
99e4c80
Attempt to resolve some conflicts before merge.
fgvanzee Apr 5, 2022
08aea6a
Merge branch 'master' into mr_nr_macros
fgvanzee Apr 5, 2022
9533519
Fixed some stale bli_cntx_get_l1v_ker_dt() calls.
fgvanzee Apr 5, 2022
2998bce
Renamed BLIS_IS_KERNEL; whitespace changes.
fgvanzee Apr 6, 2022
f316e16
Merge branch 'mr_nr_macros' into const-roundup
fgvanzee Apr 8, 2022
94252b0
Remove const from typed unpackm kernel cntx_t parameter.
devinamatthews Apr 8, 2022
7d4a046
Moved stale zen2 copy of bli_axpyf_zen_int_5.c.
fgvanzee Apr 8, 2022
bd71fce
Backup.
devinamatthews May 23, 2022
aa35c75
Merge remote-tracking branch 'upstream/master' into plugins
devinamatthews Jul 13, 2022
23fd6a2
Add `#line` directives to flattened `blis.h`.
devinamatthews Jul 13, 2022
e44c123
Comment updates.
fgvanzee Jul 25, 2022
16336ce
De-template the sup_var1n2m code and combine A/B packing functions.
devinamatthews Sep 7, 2022
eda8d3d
Merge branch 'master' into thrinfo_changes
devinamatthews Sep 7, 2022
580e0b8
Merge remote-tracking branch 'upstream/plugins' into thrinfo_changes2
devinamatthews Sep 29, 2022
7a8a58d
Fix some bugs in bli_pool.c
devinamatthews Sep 29, 2022
989d282
Add an --enable-asan option.
devinamatthews Sep 29, 2022
aef6656
Reinstate sanity check in bli_pool_finalize.
devinamatthews Oct 3, 2022
8d3a84c
Fix memory leak in ukr tests.
devinamatthews Oct 3, 2022
6219fb7
Merge branch 'master' into thrinfo_changes2
devinamatthews Oct 3, 2022
4b48a63
Fix threading bugs in TRSM.
devinamatthews Oct 5, 2022
054a774
Merge remote-tracking branch 'upstream/master' into thrinfo_changes2
devinamatthews Oct 5, 2022
92c1bf4
Export needed public symbols
devinamatthews Oct 5, 2022
3289553
Fix the gemmlike sandbox.
devinamatthews Oct 6, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion addon/gemmd/bao_gemmd_bp_var1.c
Original file line number Diff line number Diff line change
Expand Up @@ -458,7 +458,7 @@ void PASTECH2(bao_,ch,varname) \
/* This barrier is needed to prevent threads from starting to pack
the next row panel of B before the current row panel is fully
computed upon. */ \
bli_thread_barrier( rntm, thread_pb ); \
bli_thread_barrier( thread_pb ); \
} \
} \
\
Expand Down
4 changes: 2 additions & 2 deletions addon/gemmd/bao_l3_packm_a.c
Original file line number Diff line number Diff line change
Expand Up @@ -61,7 +61,7 @@ void PASTECH2(bao_,ch,opname) \
\
/* Barrier to make sure all threads are caught up and ready to begin the
packm stage. */ \
bli_thread_barrier( rntm, thread ); \
bli_thread_barrier( thread ); \
\
/* Compute the size of the memory block eneded. */ \
siz_t size_needed = sizeof( ctype ) * m_pack * k_pack; \
Expand Down Expand Up @@ -319,7 +319,7 @@ void PASTECH2(bao_,ch,opname) \
); \
\
/* Barrier so that packing is done before computation. */ \
bli_thread_barrier( rntm, thread ); \
bli_thread_barrier( thread ); \
}

//INSERT_GENTFUNC_BASIC0( packm_a )
Expand Down
4 changes: 2 additions & 2 deletions addon/gemmd/bao_l3_packm_b.c
Original file line number Diff line number Diff line change
Expand Up @@ -61,7 +61,7 @@ void PASTECH2(bao_,ch,opname) \
\
/* Barrier to make sure all threads are caught up and ready to begin the
packm stage. */ \
bli_thread_barrier( rntm, thread ); \
bli_thread_barrier( thread ); \
\
/* Compute the size of the memory block eneded. */ \
siz_t size_needed = sizeof( ctype ) * k_pack * n_pack; \
Expand Down Expand Up @@ -319,7 +319,7 @@ void PASTECH2(bao_,ch,opname) \
); \
\
/* Barrier so that packing is done before computation. */ \
bli_thread_barrier( rntm, thread ); \
bli_thread_barrier( thread ); \
}

//INSERT_GENTFUNC_BASIC0( packm_b )
Expand Down
Loading