Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Pipeline the load of scales in k-loop
cla signed
fb-exported
#2601
opened May 17, 2024 by
htyu
Loading…
Reorder load and scaling code to allow latency hidding for block-wise scaled GEMMs
cla signed
fb-exported
#2600
opened May 17, 2024 by
htyu
Loading…
use total_L is None for dense_to_jagged
cla signed
fb-exported
#2599
opened May 16, 2024 by
ColinPeppler
Loading…
Add cache conflict miss support (backend)
cla signed
fb-exported
#2596
opened May 16, 2024 by
sryap
Loading…
Print periodic logs in SSD TBE benchmark
cla signed
fb-exported
#2580
opened May 10, 2024 by
pranjalssh
Loading…
Set directory location is SSD TBE benchmarks
cla signed
fb-exported
#2579
opened May 10, 2024 by
pranjalssh
Loading…
all_to_one cuda support non-2d inputs
cla signed
fb-exported
#2575
opened May 9, 2024 by
IvanKobzarev
Loading…
add max norm support to PARTIAL_ROWWISE_ADAM
cla signed
fb-exported
#2567
opened May 7, 2024 by
zainhuda
Loading…
Pyre Configurationless migration for] [batch:9/28]
cla signed
fb-exported
#2557
opened May 3, 2024 by
connernilsen
Loading…
Pyre Configurationless migration for] [batch:6/29]
cla signed
#2548
opened Apr 29, 2024 by
connernilsen
Loading…
Integrate triton row and blockwise fp8 gemm to llm inference.
cla signed
fb-exported
#2547
opened Apr 29, 2024 by
choutim
Loading…
Make CowClipDefinition and CounterBasedRegularizationDefinition hashable
cla signed
#2539
opened Apr 27, 2024 by
csmiler
Loading…
Pyre Configurationless migration for] [batch:6/29]
cla signed
#2538
opened Apr 25, 2024 by
connernilsen
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.