Software and Code - Computational Biology and Machine Learning (CBML)

Class-Discriminative Attention Maps (CDAM)

Explainable AI method for vision transformer (ViT) to estimate the importance scores of input features with respect to the class or concept.

Concept Saliency Maps (CSM)

Evaluate and visualize latent representation of high-level concepts in generative models, such as variational autoencoders (VAEs).

Reproducible Code

Feature Perturbation Augmentation (FPA)

Data augmentation technique for deep learning training to reduce perturbation artifacts in downstream evaluation of explainable AI methods.

Reproducible Code

Jaccard

Statistical tests of similarity between binary data using the Jaccard/Tanimoto coefficient – the ratio of intersection to union.

Stable R package on CRAN
Dev R package on GitHub

Jackstraw

Statistical methods to evaluate association between variables and their estimated latent variables. Latent variables may be estimated by principal component analysis (PCA), logistic factor analysis (LFA), clustering, and related techniques.

Stable R package on CRAN
Dev R package on GitHub

Tutorials

Association test with Principal Components
Statistical test of cluster memberships with the mtcars example
Unsupervised evaluation of cell identities in single cell genomics

Jaws

Jackstraw weighted shrinkage estimation for high-dimensional latent variable models. The jackstraw is used to estimate sparse loadings (i.e., coefficients) of Principal Component Analysis, Logistic Factor Analysis, and related techniques.

Dev R package