{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# SVI Part II: Conditional Independence, Subsampling, and Amortization\n", "\n", "## The Goal: Scaling SVI to Large Datasets\n", "\n", "For a model with $N$ observations, running the `model` and `guide` and constructing the ELBO involves evaluating log pdf's whose complexity scales badly with $N$. This is a problem if we want to scale to large datasets. Luckily, the ELBO objective naturally supports subsampling provided that our model/guide have some conditional independence structure that we can take advantage of. For example, in the case that the observations are conditionally independent given the latents, the log likelihood term in the ELBO can be approximated with\n", "\n", "$$ \\sum_{i=1}^N \\log p({\\bf x}_i | {\\bf z}) \\approx \\frac{N}{M}\n", "\\sum_{i\\in{\\mathcal{I}_M}} \\log p({\\bf x}_i | {\\bf z}) $$\n", "\n", "where $\\mathcal{I}_M$ is a mini-batch of indices of size $M$ with $M    \n", "Matthew D. Hoffman, David M. Blei, Chong Wang, John Paisley\n", "\n", "[2] `Auto-Encoding Variational Bayes`,
    \n", "Diederik P Kingma, Max Welling" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.6.10" } }, "nbformat": 4, "nbformat_minor": 2 }