Thesis docs
|
Normalized Generalized Gamma Process class for Bayesian nonparametric clustering. More...
#include <NGGP.hpp>
Public Member Functions | |
NGGP (Data &d, Params &p) | |
Constructor for the Normalized Generalized Gamma Process. | |
Gibbs Sampling Methods | |
double | gibbs_prior_existing_cluster (int cls_idx, int obs_idx=0) const override |
Computes the log prior probability of assigning a data point to an existing cluster. | |
Eigen::VectorXd | gibbs_prior_existing_clusters (int obs_idx) const override |
Computes the log prior probabilities of assigning a data point to every existing cluster. This method is useful for Gibbs sampling over existing clusters. It returns a vector of log prior probabilities for all existing clusters. | |
double | gibbs_prior_new_cluster () const override |
Computes the log prior probability of assigning a data point to a new cluster. | |
Split-Merge Algorithm Methods | |
double | prior_ratio_split (int ci, int cj) const override |
Computes the prior ratio for a split operation in an NGGP-based split-merge MCMC algorithm. | |
double | prior_ratio_merge (int size_old_ci, int size_old_cj) const override |
Computes the prior ratio for a merge operation in an NGGP-based split-merge MCMC algorithm. | |
double | prior_ratio_shuffle (int size_old_ci, int size_old_cj, int ci, int cj) const override |
Computes the prior ratio for a shuffle operation in an NGGP-based split-merge MCMC algorithm. | |
Parameter Update Methods | |
void | update_params () override |
Updates the NGGP parameters by updating the latent variable U. | |
Accessor Methods | |
double | get_U () const |
Gets the current value of the latent variable U. | |
int | get_accepted_U () const |
Gets the number of accepted U updates for monitoring convergence. | |
![]() | |
Process (Data &d, Params &p) | |
Constructor initializing process with data and parameters. | |
void | set_old_allocations (const Eigen::VectorXi &new_allocations) |
Store current allocations for potential rollback. | |
void | set_idx_i (int i) |
Set index of first observation in split-merge pair. | |
void | set_idx_j (int j) |
Set index of second observation in split-merge pair. | |
virtual | ~Process () |
Virtual destructor for proper cleanup of derived classes. | |
Additional Inherited Members | |
![]() | |
Data & | data |
Reference to the data object containing observations and allocations. | |
Params & | params |
Reference to the parameters object containing process hyperparameters. | |
Eigen::VectorXi | old_allocations |
Storage for previous allocations to enable rollback in case of rejection. | |
int | idx_i |
Index of first observation involved in split-merge move. | |
int | idx_j |
Index of second observation involved in split-merge move. | |
const double | log_a = log(params.a) |
Precomputed logarithm of total mass parameter for efficiency. | |
Normalized Generalized Gamma Process class for Bayesian nonparametric clustering.
This class implements a Normalized Generalized Gamma Process (NGGP) that extends the Dirichlet Process by incorporating a latent variable U. The NGGP provides more flexibility in modeling cluster sizes and incorporates adaptive behavior through the U parameter.
|
inline |
Gets the number of accepted U updates for monitoring convergence.
|
inline |
Gets the current value of the latent variable U.
|
nodiscardoverridevirtual |
Computes the log prior probability of assigning a data point to an existing cluster.
For NGGP, this incorporates the discount parameter sigma, giving probability proportional to (n_k - sigma) where n_k is the cluster size.
cls_idx | The index of the cluster. |
obs_idx | The index of the observation (default: 0, unused in this implementation). |
Computes the log prior probability of assigning a data point to an existing cluster.
For NGGP, this incorporates the discount parameter sigma, giving probability proportional to (n_k - sigma) where n_k is the cluster size.
cls_idx | The index of the cluster. |
obs_idx | The index of the observation (unused in this implementation). |
Implements Process.
|
nodiscardoverridevirtual |
Computes the log prior probabilities of assigning a data point to every existing cluster. This method is useful for Gibbs sampling over existing clusters. It returns a vector of log prior probabilities for all existing clusters.
obs_idx | The index of the observation to assign. |
Computes the log prior probabilities of assigning a data point to all existing clusters.
This method incorporates spatial information by considering the number of neighbors in each target cluster when computing the prior probabilities.
obs_idx | The index of the observation to assign. |
Implements Process.
|
nodiscardoverridevirtual |
Computes the log prior probability of assigning a data point to a new cluster.
For NGGP, this depends on the latent variable U and is proportional to alpha * sigma * (tau + U)^sigma.
Computes the log prior probability of assigning a data point to a new cluster.
For NGGP, this depends on the latent variable U and is proportional to alpha * sigma * (tau + U)^sigma.
Implements Process.
|
nodiscardoverridevirtual |
Computes the prior ratio for a merge operation in an NGGP-based split-merge MCMC algorithm.
This method accounts for the generalized gamma process prior when computing the acceptance ratio for merging clusters.
size_old_ci | The size of the first cluster before the merge. |
size_old_cj | The size of the second cluster before the merge. |
Computes the prior ratio for a merge operation in an NGGP-based split-merge MCMC algorithm.
This method accounts for the generalized gamma process prior when computing the acceptance ratio for merging clusters.
size_old_ci | The size of the first cluster before the merge. |
size_old_cj | The size of the second cluster before the merge. |
Implements Process.
|
nodiscardoverridevirtual |
Computes the prior ratio for a shuffle operation in an NGGP-based split-merge MCMC algorithm.
This method accounts for the generalized gamma process prior when computing the acceptance ratio for shuffling observations between clusters.
size_old_ci | The size of the first cluster before the shuffle. |
size_old_cj | The size of the second cluster before the shuffle. |
ci | The first cluster index involved in the shuffle. |
cj | The second cluster index involved in the shuffle. |
Computes the prior ratio for a shuffle operation in an NGGP-based split-merge MCMC algorithm.
This method accounts for the generalized gamma process prior when computing the acceptance ratio for shuffling observations between clusters.
size_old_ci | The size of the first cluster before the shuffle. |
size_old_cj | The size of the second cluster before the shuffle. |
ci | The first cluster index involved in the shuffle. |
cj | The second cluster index involved in the shuffle. |
Implements Process.
|
nodiscardoverridevirtual |
Computes the prior ratio for a split operation in an NGGP-based split-merge MCMC algorithm.
This method accounts for the generalized gamma process prior when computing the acceptance ratio for splitting clusters.
ci | The first cluster index involved in the split. |
cj | The second cluster index involved in the split. |
Computes the prior ratio for a split operation in an NGGP-based split-merge MCMC algorithm.
This method accounts for the generalized gamma process prior when computing the acceptance ratio for splitting clusters.
ci | The first cluster index involved in the split. |
cj | The second cluster index involved in the split. |
Implements Process.
|
inlineoverridevirtual |