Package: localLLM 1.3.1

Yaosheng Xu

localLLM: Running Local LLMs with 'llama.cpp' Backend

Provides R bindings to the 'llama.cpp' library for running large language models. The package uses a lightweight architecture where the C++ backend library is downloaded at runtime rather than bundled with the package. Package features include text generation, reproducible generation, and parallel inference.

Authors:Eddie Yang [aut], Yaosheng Xu [aut, cre]

localLLM_1.3.1.tar.gz
localLLM_1.3.1.zip(r-4.7)localLLM_1.3.1.zip(r-4.6)localLLM_1.3.1.zip(r-4.5)
localLLM_1.3.1.tgz(r-4.6-x86_64)localLLM_1.3.1.tgz(r-4.6-arm64)localLLM_1.3.1.tgz(r-4.5-x86_64)localLLM_1.3.1.tgz(r-4.5-arm64)
localLLM_1.3.1.tar.gz(r-4.7-arm64)localLLM_1.3.1.tar.gz(r-4.7-x86_64)localLLM_1.3.1.tar.gz(r-4.6-arm64)localLLM_1.3.1.tar.gz(r-4.6-x86_64)
localLLM_1.3.1.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
DESCRIPTION |NEWS
card.svg |card.png
localLLM/json (API)

# Install 'localLLM' in R:

install.packages('localLLM', repos = c('https://eddieyang211.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/eddieyang211/localllm/issues

Uses libs:

c++– GNU Standard C++ Library v3

Datasets:

ag_news_sample - AG News classification sample

On CRAN:

cpp

7.59 score 9 stars 18 scripts 216 downloads 30 exports 7 dependencies

Last updated from:a017b26b66. Checks:13 OK. Indexed: yes.

Target	Result	Time
linux-devel-arm64	OK	163
linux-devel-x86_64	OK	132
source / vignettes	OK	207
linux-release-arm64	OK	159
linux-release-x86_64	OK	132
macos-release-arm64	OK	186
macos-release-x86_64	OK	287
macos-oldrel-arm64	OK	211
macos-oldrel-x86_64	OK	364
windows-devel	OK	110
windows-release	OK	94
windows-oldrel	OK	96
wasm-release	OK	125

Exports:annotation_sink_csv apply_chat_template apply_gemma_chat_template backend_free backend_init compute_confusion_matrices context_create detokenize document_end document_start download_model explore generate generate_parallel get_lib_path get_model_cache_dir hardware_profile install_localLLM intercoder_reliability lib_is_installed list_cached_models list_ollama_models model_load quick_llama quick_llama_reset set_hf_token smart_chat_template tokenize tokenize_test validate

Dependencies:curl digest jsonlite R.methodsS3 R.oo R.utils Rcpp

Reproducible Output

Rendered fromreproducible-output.Rmdusingknitr::rmarkdown

Last update: 2026-05-05
Started: 2025-12-12

Frequently Asked Questions

Rendered fromfaq.Rmdusingknitr::rmarkdown

Last update: 2026-04-26
Started: 2025-12-12

Get Started with localLLM

Rendered fromget-started.Rmdusingknitr::rmarkdown

Last update: 2026-04-26
Started: 2025-12-12

Parallel Processing

Rendered fromtutorial-parallel-processing.Rmdusingknitr::rmarkdown

Last update: 2026-04-20
Started: 2025-12-12

Basic Text Generation

Rendered fromtutorial-basic-generation.Rmdusingknitr::rmarkdown

Last update: 2026-04-13
Started: 2025-12-12

Model Comparison & Validation

Rendered fromtutorial-model-comparison.Rmdusingknitr::rmarkdown

Last update: 2026-04-06
Started: 2025-12-12

Ollama Integration

Rendered fromtutorial-ollama-integration.Rmdusingknitr::rmarkdown

Last update: 2026-02-24
Started: 2025-12-12

Citation

Development and contributors

Readme and manuals

Help Manual

Help page	Topics
R Interface to llama.cpp with Runtime Library Loading	localLLM-package localLLM
AG News classification sample	ag_news_sample
Create a CSV sink for streaming annotation chunks	annotation_sink_csv
Apply Chat Template to Format Conversations	apply_chat_template
Apply Gemma-Compatible Chat Template	apply_gemma_chat_template
Free localLLM backend	backend_free
Initialize localLLM backend	backend_init
Compute confusion matrices from multi-model annotations	compute_confusion_matrices
Create Inference Context for Text Generation	context_create
Convert Token IDs Back to Text	detokenize
Finish automatic run documentation	document_end
Start automatic run documentation	document_start
Download a model manually	download_model
Compare multiple LLMs over a shared set of prompts	explore
Generate Text Using Language Model Context	generate
Generate Text in Parallel for Multiple Prompts	generate_parallel
Get Backend Library Path	get_lib_path
Get the model cache directory	get_model_cache_dir
Inspect detected hardware resources	hardware_profile
Install localLLM Backend Library	install_localLLM
Intercoder reliability for LLM annotations	intercoder_reliability
Check if Backend Library is Installed	lib_is_installed
List cached models on disk	list_cached_models
List GGUF models managed by Ollama	list_ollama_models
Load Language Model with Automatic Download Support	model_load
Get All GGUF Metadata from a Loaded Model	model_metadata
Quick LLaMA Inference	quick_llama
Reset quick_llama state	quick_llama_reset
Configure Hugging Face access token	set_hf_token
Smart Chat Template Application	smart_chat_template
Convert Text to Token IDs	tokenize
Test tokenize function (debugging)	tokenize_test
Validate model predictions against gold labels and peer agreement	validate