pairwiseLLM

Pairwise Comparison Tools for Large Language Model-Based Writing Evaluation

v1.1.0 · Dec 22, 2025 · MIT + file LICENSE

Description

Provides a unified framework for generating, submitting, and analyzing pairwise comparisons of writing quality using large language models (LLMs). The package supports live and/or batch evaluation workflows across multiple providers ('OpenAI', 'Anthropic', 'Google Gemini', 'Together AI', and locally-hosted 'Ollama' models), includes bias-tested prompt templates and a flexible template registry, and offers tools for constructing forward and reversed comparison sets to analyze consistency and positional bias. Results can be modeled using Bradley–Terry (1952) <doi:10.2307/2334029> or Elo rating methods to derive writing quality scores. For information on the method of pairwise comparisons, see Thurstone (1927) <doi:10.1037/h0070288> and Heldsinger & Humphry (2010) <doi:10.1007/BF03216919>. For information on Elo ratings, see Clark et al. (2018) <doi:10.1371/journal.pone.0190393>.

Downloads

534

Last 30 days

7037th

534

Last 90 days

534

Last year

CRAN Check Status

14 OK

Show all 14 flavors

Flavor	Status	Time
r-devel-linux-x86_64-debian-clang	OK	133.5s
r-devel-linux-x86_64-debian-gcc	OK	84.1s
r-devel-linux-x86_64-fedora-clang	OK	207.2s
r-devel-linux-x86_64-fedora-gcc	OK	205.1s
r-devel-macos-arm64	OK	39s
r-devel-windows-x86_64	OK	156s
r-oldrel-macos-arm64	OK	43s
r-oldrel-macos-x86_64	OK	231s
r-oldrel-windows-x86_64	OK	196s
r-patched-linux-x86_64	OK	120.2s
r-release-linux-x86_64	OK	118.6s
r-release-macos-arm64	OK	42s
r-release-macos-x86_64	OK	277s
r-release-windows-x86_64	OK	154s

Check details (14 non-OK)

OK r-devel-linux-x86_64-debian-clang

OK r-devel-linux-x86_64-debian-gcc

OK r-devel-linux-x86_64-fedora-clang

OK r-devel-linux-x86_64-fedora-gcc

OK r-devel-macos-arm64

OK r-devel-windows-x86_64

OK r-oldrel-macos-arm64

OK r-oldrel-macos-x86_64

OK r-oldrel-windows-x86_64

OK r-patched-linux-x86_64

OK r-release-linux-x86_64

OK r-release-macos-arm64

OK r-release-macos-x86_64

OK r-release-windows-x86_64

Check History

OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 9, 2026

Dependency Network

Version History

new 1.1.0 Mar 9, 2026

Maintainer

Sterett H. Mercer

Dependencies

Depends

R (>= 4.1)

Imports

curl dplyr httr2 jsonlite rlang stats tibble tidyselect tools utils

Suggests

BradleyTerry2 EloChoice knitr mockery purrr readr rmarkdown sirt stringr testthat (>= 3.0.0) tidyr withr

Compilation

No compilation needed

First Published

Mar 9, 2026

RSS Feed

CRAN Checks

View on CRAN →