Package: PsychWordVec 2025.3

PsychWordVec: Word Embedding Research Framework for Psychological Science

An integrative toolbox of word embedding research that provides: (1) a collection of 'pre-trained' static word vectors in the '.RData' compressed format <https://psychbruce.github.io/WordVector_RData.pdf>; (2) a group of functions to process, analyze, and visualize word vectors; (3) a range of tests to examine conceptual associations, including the Word Embedding Association Test <doi:10.1126/science.aal4230> and the Relative Norm Distance <doi:10.1073/pnas.1720347115>, with permutation test of significance; and (4) a set of training methods to locally train (static) word vectors from text corpora, including 'Word2Vec' <doi:10.48550/arXiv.1301.3781>, 'GloVe' <doi:10.3115/v1/D14-1162>, and 'FastText' <doi:10.48550/arXiv.1607.04606>.

Authors:Han-Wu-Shuang Bao [aut, cre]

PsychWordVec_2025.3.tar.gz
PsychWordVec_2025.3.zip(r-4.5)PsychWordVec_2025.3.zip(r-4.4)PsychWordVec_2025.3.zip(r-4.3)
PsychWordVec_2025.3.tgz(r-4.5-any)PsychWordVec_2025.3.tgz(r-4.4-any)PsychWordVec_2025.3.tgz(r-4.3-any)
PsychWordVec_2023.9.tar.gz(r-4.5-noble)PsychWordVec_2025.3.tar.gz(r-4.4-noble)
PsychWordVec_2025.3.tgz(r-4.4-emscripten)PsychWordVec_2025.3.tgz(r-4.3-emscripten)
PsychWordVec.pdf |PsychWordVec.html✨
PsychWordVec/json (API)
NEWS

# Install 'PsychWordVec' in R:

install.packages('PsychWordVec', repos = c('https://psychbruce.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/psychbruce/psychwordvec/issues

Pkgdown site:https://psychbruce.github.io

Datasets:

demodata - Demo data (pre-trained using word2vec on Google News; 8000 vocab, 300 dims).

On CRAN:

4.04 score 22 stars 10 scripts 397 downloads 29 exports 181 dependencies

Last updated 2 days agofrom:660631c3e6. Checks:8 OK, 1 FAILURE. Indexed: yes.

Target	Result	Latest binary
Doc / Vignettes	OK	Mar 30 2025
R-4.5-win	OK	Mar 30 2025
R-4.5-mac	OK	Mar 30 2025
R-4.5-linux	OUTDATED	Feb 22 2025
R-4.4-win	OK	Mar 30 2025
R-4.4-mac	OK	Mar 30 2025
R-4.4-linux	OK	Mar 30 2025
R-4.3-win	OK	Mar 30 2025
R-4.3-mac	OK	Mar 30 2025

Exports:as_embed as_wordvec cc cos_dist cos_sim cosine_similarity data_transform data_wordvec_load data_wordvec_subset dict_expand dict_reliability get_wordvec load_embed load_wordvec most_similar normalize orth_procrustes pair_similarity pattern plot_network plot_similarity plot_wordvec plot_wordvec_tSNE sum_wordvec tab_similarity test_RND test_WEAT tokenize train_wordvec

Dependencies:abind afex askpass backports base64enc bayestestR bit bit64 boot broom broom.mixed bruceR bslib cachem car carData cellranger checkmate cli clipr cluster coda codetools colorspace corpcor corrplot cowplot cpp11 crayon curl data.table datawizard Deriv digest doBy dplyr effectsize emmeans estimability evaluate fansi farver fastmap fastTextR fdrtool float fontawesome forcats foreign Formula fs furrr future generics ggplot2 ggrepel glasso globals glue GPArotation gridExtra gtable gtools haven highr Hmisc hms htmlTable htmltools htmlwidgets httr igraph insight interactions isoband jpeg jquerylib jsonlite jtools knitr labeling lattice lavaan lgr lifecycle listenv lme4 lmerTest lpSolve magrittr MASS Matrix MatrixExtra MatrixModels mediation memoise mgcv microbenchmark mime minqa mlapi mnormt modelr munsell mvtnorm nlme nloptr nnet numDeriv openssl pander parallelly parameters pbapply pbivnorm pbkrtest performance pillar pkgconfig plyr png prettyunits progress psych purrr qgraph quadprog quantreg R.methodsS3 R.oo R.utils R6 rappdirs rbibutils RColorBrewer Rcpp RcppArmadillo RcppEigen RcppProgress Rdpack readr readxl reformulas rematch reshape2 rgl RhpcBLASctl rio rlang rmarkdown rpart rsparse rstudioapi Rtsne sandwich sass scales slam SparseM stringi stringr survival sys texreg text2vec tibble tidyr tidyselect tinytex tzdb utf8 vctrs viridis viridisLite vroom withr word2vec writexl xfun yaml zoo

Citation

Development and contributors

Readme and manuals

Help Manual

Help page	Topics
Word vectors data class: 'wordvec' and 'embed'.	as_embed as_wordvec pattern [.embed
Cosine similarity/distance between two vectors.	cosine_similarity cos_dist cos_sim
Transform plain text of word vectors into 'wordvec' (data.table) or 'embed' (matrix), saved in a compressed ".RData" file.	data_transform
Load word vectors data ('wordvec' or 'embed') from ".RData" file.	data_wordvec_load load_embed load_wordvec
Extract a subset of word vectors data (with S3 methods).	data_wordvec_subset subset.embed subset.wordvec
Demo data (pre-trained using word2vec on Google News; 8000 vocab, 300 dims).	demodata
Expand a dictionary from the most similar words.	dict_expand
Reliability analysis and PCA of a dictionary.	dict_reliability
Extract word vector(s).	get_wordvec
Find the Top-N most similar words.	most_similar
Normalize all word vectors to the unit length 1.	normalize
Orthogonal Procrustes rotation for matrix alignment.	orth_procrustes
Compute a matrix of cosine similarity/distance of word pairs.	pair_similarity
Visualize a (partial correlation) network graph of words.	plot_network
Visualize cosine similarity of word pairs.	plot_similarity
Visualize word vectors.	plot_wordvec
Visualize word vectors with dimensionality reduced using t-SNE.	plot_wordvec_tSNE
Calculate the sum vector of multiple words.	sum_wordvec
Tabulate cosine similarity/distance of word pairs.	tab_similarity
Relative Norm Distance (RND) analysis.	test_RND
Word Embedding Association Test (WEAT) and Single-Category WEAT.	test_WEAT
Tokenize raw text for training word embeddings.	tokenize
Train static word embeddings using the Word2Vec, GloVe, or FastText algorithm.	train_wordvec