Skip to content

Commit 2cf2631

Browse files
committed
update readme
1 parent 770f5af commit 2cf2631

2 files changed

Lines changed: 8 additions & 6 deletions

File tree

README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@ FlagEmbedding focuses on retrieval-augmented LLMs, consisting of the following p
3838

3939
- **Inference**: [Embedder](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/inference/embedder), [Reranker](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/inference/reranker)
4040
- **Finetune**: [Embedder](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/finetune/embedder), [Reranker](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/finetune/reranker)
41-
- **Evaluation**: [MTEB](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation/mteb), [BEIR](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation/beir), [MSMARCO](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation/msmarco), [MIRACL](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation/miracl), [MLDR](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation/mldr), [MKQA](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation/mkqa), [AIR-Bench](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation/air_bench)
41+
- **Evaluation**: [MTEB](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation#1-mteb), [BEIR](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation#2-beir), [MSMARCO](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation#3-msmarco), [MIRACL](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation#4-miracl), [MLDR](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation#5-mldr), [MKQA](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation#6-mkqa), [AIR-Bench](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation#7-air-bench), [Custom Dataset](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/examples/evaluation#8-custom-dataset)
4242
- **[Dataset](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/dataset)**: [MLDR](https://huggingface.co/datasets/Shitao/MLDR), [bge-m3-data](https://huggingface.co/datasets/Shitao/bge-m3-data), [public-data](https://huggingface.co/datasets/cfli/bge-e5data), [full-data](https://huggingface.co/datasets/cfli/bge-full-data), [reranker-data](Shitao/bge-reranker-data)
4343
- **[Tutorials](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/Tutorials)**
4444
- **research**:
@@ -144,7 +144,7 @@ The following contents are releasing in the upcoming weeks:
144144
</details>
145145

146146

147-
## Projects
147+
## [Projects](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagembedding-v1/research)
148148

149149
### BGE-M3 ([Paper](https://arxiv.org/pdf/2402.03216.pdf), [Code](https://github.com/FlagOpen/FlagEmbedding/tree/master/FlagEmbedding/BGE_M3))
150150

@@ -176,7 +176,7 @@ More details please refer to our [paper](https://arxiv.org/abs/2401.03462) and [
176176

177177

178178
### [LM-Cocktail](https://github.com/FlagOpen/FlagEmbedding/tree/master/LM_Cocktail)
179-
179+
180180
LM-Cocktail automatically merges fine-tuned models and base model using a simple function to compute merging weights.
181181
LM-Cocktail can be used to improve the performance on target domain without decrease the general capabilities beyond target domain,
182182
as well as generate a model for new tasks without fine-tuning.

examples/finetune/reranker/README.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -75,9 +75,11 @@ See [example_data](https://github.com/hanhainebula/FlagEmbedding/tree/new-flagem
7575
python add_reranker_score.py \
7676
--input_file toy_finetune_data_minedHN.jsonl \
7777
--output_file toy_finetune_data_score.jsonl \
78-
--range_for_sampling 2-200 \
79-
--negative_number 15 \
80-
--use_gpu_for_searching
78+
--reranker_name_or_path BAAI/bge-reranker-v2-m3 \
79+
--devices cuda:0 cuda:1 \
80+
--cache_dir ./cache/model \
81+
--reranker_query_max_length 512 \
82+
--reranker_max_length 1024
8183
```
8284

8385
- `input_file`: path to save JSON data with mined hard negatives for finetuning

0 commit comments

Comments
 (0)