Codesota · Models · RankLLaMA-7BCastorini (Waterloo)1 results · 1 benchmarks
Model card

RankLLaMA-7B.

Castorini (Waterloo)open-source7B paramsLLaMA-2-7B (pointwise reranker)

Fine-Tuning LLaMA for Multi-Stage Text Retrieval. arXiv 2310.08319.

§ 01 · Card

Model card,
inline.

Rendered server-side from the upstream README on Hugging Face — same content as the source repo, with editorial typography. The full card, sample weights, and revision history live on HF.


Source
castorini/rankllama-v1-7b-lora-passage
License
llama2

RankLLaMA-7B-Passage

Fine-Tuning LLaMA for Multi-Stage Text Retrieval. Xueguang Ma, Liang Wang, Nan Yang, Furu Wei, Jimmy Lin, arXiv 2023

This model is fine-tuned from LLaMA-2-7B using LoRA for passage reranking.

Training Data

The model is fine-tuned on the training split of MS MARCO Passage Ranking datasets for 1 epoch. Please check our paper for details.

Usage

Below is an example to compute the similarity score of a query-passage pair

python
import torch from transformers import AutoModelForSequenceClassification, AutoTokenizer from peft import PeftModel, PeftConfig def get_model(peft_model_name): config = PeftConfig.from_pretrained(peft_model_name) base_model = AutoModelForSequenceClassification.from_pretrained(config.base_model_name_or_path, num_labels=1) model = PeftModel.from_pretrained(base_model, peft_model_name) model = model.merge_and_unload() model.eval() return model # Load the tokenizer and model tokenizer = AutoTokenizer.from_pretrained('meta-llama/Llama-2-7b-hf') model = get_model('castorini/rankllama-v1-7b-lora-passage') # Define a query-passage pair query = "What is llama?" title = "Llama" passage = "The llama is a domesticated South American camelid, widely used as a meat and pack animal by Andean cultures since the pre-Columbian era." # Tokenize the query-passage pair inputs = tokenizer(f'query: {query}', f'document: {title} {passage}', return_tensors='pt') # Run the model forward with torch.no_grad(): outputs = model(**inputs) logits = outputs.logits score = logits[0][0] print(score)

Batch inference and training

An unofficial replication of the inference and training code can be found here

Citation

If you find our paper or models helpful, please consider cite as follows:

@article{rankllama,
      title={Fine-Tuning LLaMA for Multi-Stage Text Retrieval}, 
      author={Xueguang Ma and Liang Wang and Nan Yang and Furu Wei and Jimmy Lin},
      year={2023},
      journal={arXiv:2310.08319},
}
Card content reproduced from huggingface.co/castorini/rankllama-v1-7b-lora-passage under the upstream license. Rendering trims fenced HTML, raw widgets and tables for safety; tap the link for the untouched original.
§ 02 · Benchmarks

Every benchmark RankLLaMA-7B has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01MS MARCONatural Language Processing · Text Rankingmrr@1041.8%#1/42023-10-12source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Strengths by area

Where RankLLaMA-7B actually performs.

Natural Language Processing
1
benchmark
avg rank #1.0
§ 04 · Papers

1 paper with results for RankLLaMA-7B.

  1. 2023-10-12· Natural Language Processing· 1 result

    Fine-Tuning LLaMA for Multi-Stage Text Retrieval

§ 05 · Related models

Other Castorini (Waterloo) models scored on Codesota.

MonoT5-3B
3B params · 0 results
§ 06 · Sources & freshness

Where these numbers come from.

arxiv
1
result
1 of 1 rows marked verified.