index
:
ben/bitsandbytes.git
main
fork of github.com/TimDettmers/bitsandbytes, trying to package it via nix
ben
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
csrc
/
kernels.cu
Age
Commit message (
Collapse
)
Author
2022-11-20
Added additional blocksizes: {64, 128, 256}.
Tim Dettmers
2022-11-07
Fixed bug in cpu quant; faster GPU dequant.
Tim Dettmers
2022-11-06
Added blocksizes 2048, 1024, and 512 to blockwise quant.
Tim Dettmers
2022-08-16
Added fused bias in dequant_mm.
Tim Dettmers
2022-08-16
Removed storage() from get_ptr; added boilerplate for bias dequant_mm.
Tim Dettmers
2022-07-26
Merge branch 'patch_merge' into extract_outliers
Tim Dettmers
2022-07-26
Added col_ampere outlier extraction kernel.
Tim Dettmers
2022-07-26
Working outlier extraction for Turing.
Tim Dettmers
2022-07-26
Boilerplate and test for extract_outliers.
Tim Dettmers
2022-07-25
Some progress on build script; added multi-cuda install script.
Tim Dettmers
2022-07-22
Fixed rowcol synchronization bug.
Tim Dettmers
2022-07-22
Most tests passing.
Tim Dettmers
2021-11-28
Added AdamW. #10 #13
Tim Dettmers
2021-11-10
Added adagrad with tests (no clipping).
Tim Dettmers
2021-10-21
Added compilation from source instructions; easier compilation.
Tim Dettmers
2021-10-20
Added skip_zeros; tests are passing.
Tim Dettmers
2021-10-20
Initial plumbing for skip_zeros.
Tim Dettmers
2021-10-05
Initial commit
Tim Dettmers