Skip to content

Commit 9ae4be0

Browse files
committed
update doc
1 parent 56e2c8e commit 9ae4be0

2 files changed

Lines changed: 3 additions & 3 deletions

File tree

Long_LLM/activation_beacon/new/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# Activation-Beacon
22

3-
[Activation Beacon](https://arxiv.org/abs/2401.03462) compresses the original KV into fewer yet more compact states (a.k.a. beacons) and hence enabling the LLM to perceive longer context given its fixed context window. It is known for the following features:
3+
[Activation Beacon](https://arxiv.org/abs/2401.03462) compresses the original KV into fewer yet more compact states (a.k.a. beacons) and hence enables the LLM to perceive longer context given its fixed context window. It is known for the following features:
44
- **Effective**
55
- there is little information loss given a compression ratio of 2, 4, and 8;
66
- **Efficient**

Long_LLM/activation_beacon/new/docs/training.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -131,8 +131,8 @@ torchrun --nproc_per_node 8 $DDP -m main.train \
131131
--min_length 7200 \
132132
--group_by_stride strict \
133133
--enable_beacon \
134-
--beacon_window 2048 \
135-
--beacon_stride 2048 \
134+
--beacon_window 1024 \
135+
--beacon_stride 1024 \
136136
--beacon_attn full-coverage \
137137
--beacon_attend_prev True \
138138
--beacon_sink_size 1 \

0 commit comments

Comments
 (0)