Skip to main content
← Back to work

Works citing this work

3 works

Work: Efficient self-attention with smart pruning for sustainable large language models