Semblance
A Data-Driven Similarity Kernel on Probability Spaces
Description
We present a rank-based Mercer kernel to compute a pair-wise similarity metric corresponding to informative representation of data. We tailor the development of a kernel to encode our prior knowledge about the data distribution over a probability space. The philosophical concept behind our construction is that objects whose feature values fall on the extreme of that feature’s probability mass distribution are more similar to each other, than objects whose feature values lie closer to the mean. Semblance emphasizes features whose values lie far away from the mean of their probability distribution. The kernel relies on properties empirically determined from the data and does not assume an underlying distribution. The use of feature ranks on a probability space ensures that Semblance is computational efficacious, robust to outliers, and statistically stable, thus making it widely applicable algorithm for pattern analysis. The output from the kernel is a square, symmetric matrix that gives proximity values between pairs of observations.
Downloads
145
Last 30 days
21995th
145
Last 90 days
145
Last year
CRAN Check Status
Show all 14 flavors
| Flavor | Status |
|---|---|
| r-devel-linux-x86_64-debian-clang | NOTE |
| r-devel-linux-x86_64-debian-gcc | NOTE |
| r-devel-linux-x86_64-fedora-clang | OK |
| r-devel-linux-x86_64-fedora-gcc | OK |
| r-devel-macos-arm64 | OK |
| r-devel-windows-x86_64 | OK |
| r-oldrel-macos-arm64 | NOTE |
| r-oldrel-macos-x86_64 | NOTE |
| r-oldrel-windows-x86_64 | NOTE |
| r-patched-linux-x86_64 | OK |
| r-release-linux-x86_64 | OK |
| r-release-macos-arm64 | OK |
| r-release-macos-x86_64 | OK |
| r-release-windows-x86_64 | OK |
Check details (14 non-OK)
CRAN incoming feasibility
Maintainer: ‘Divyansh Agarwal <divyansh@upenn.edu>’
No Authors@R field in DESCRIPTION.
Please add one, modifying
Authors@R: c(person(given = c("Divyansh", "Agarwal", "Nancy", "R.", "Zhang"),
family = "<nzh@wharton.upenn.edu>",
role = "aut",
email = "nzh@wharton.upenn.edu"),
person(given = "Divyansh",
family = "Agarwal",
role = "cre",
email = "divyansh@upenn.edu"))
as necessary.
CRAN incoming feasibility
Maintainer: ‘Divyansh Agarwal <divyansh@upenn.edu>’
No Authors@R field in DESCRIPTION.
Please add one, modifying
Authors@R: c(person(given = c("Divyansh", "Agarwal", "Nancy", "R.", "Zhang"),
family = "<nzh@wharton.upenn.edu>",
role = "aut",
email = "nzh@wharton.upenn.edu"),
person(given = "Divyansh",
family = "Agarwal",
role = "cre",
email = "divyansh@upenn.edu"))
as necessary.
*
*
*
*
LazyData
'LazyData' is specified without a 'data' directory
LazyData
'LazyData' is specified without a 'data' directory
LazyData
'LazyData' is specified without a 'data' directory
*
*
*
*
*
Check History
NOTE 9 OK · 5 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 9, 2026
CRAN incoming feasibility
Maintainer: ‘Divyansh Agarwal <divyansh@upenn.edu>’
No Authors@R field in DESCRIPTION.
Please add one, modifying
Authors@R: c(person(given = c("Divyansh", "Agarwal", "Nancy", "R.", "Zhang"),
family = "<nzh@wharton.upenn.edu>",
role = "aut",
email = "nzh@wharton.upenn.edu"),
person(given = "Divyansh",
family = "Agarwal",
role = "cre",
email = "divya
CRAN incoming feasibility
Maintainer: ‘Divyansh Agarwal <divyansh@upenn.edu>’
No Authors@R field in DESCRIPTION.
Please add one, modifying
Authors@R: c(person(given = c("Divyansh", "Agarwal", "Nancy", "R.", "Zhang"),
family = "<nzh@wharton.upenn.edu>",
role = "aut",
email = "nzh@wharton.upenn.edu"),
person(given = "Divyansh",
family = "Agarwal",
role = "cre",
email = "divya
LazyData
'LazyData' is specified without a 'data' directory
LazyData
'LazyData' is specified without a 'data' directory
LazyData
'LazyData' is specified without a 'data' directory