Yekun's Note

An Introduction to Capsules

Posted on 2020-04-23 In NN , Capsule Disqus:

A capsule is defined as a group of neuron instantiations whose parameters represent specific properties of a specific type of entity. Here is a brief note of Capsule networks^[1]^[2].

Decoding in Text Generation

Posted on 2020-04-21 In NLP , Conditional LM , Decoding Disqus:

Summary of common decoding strategies in language generation.

Sparse Matrix in Data Processing

Posted on 2020-04-03 Edited on 2022-05-16 In Programming practical , Numerical computation Disqus:

It is wasteful to store zeros elements in a sparse matrix, especially for incrementally data. When constructing tf-idf and bag-of-words features or saving graph ajacent matrix, non-efficient sparse matrix storage might lead to the memory error. To circumvent this problems, efficient sparse matrix storage is a choice.

Clustering Methods: A Note

Posted on 2020-03-20 In ML , Clustering Disqus:

Notes of clustering approaches.

An Introduction to Graph Neural Networks

Posted on 2020-03-16 In Graph Neural Networks Disqus:

Graph Neural Networks (GNNs) has demonstrated efficacy on non-Euclidean data, such as social media, bioinformatics, etc.

Image source: ^[1]

Generative Adversarial Networks

Posted on 2020-01-15 In Unsupervised learning , GAN Disqus:

GANs are widely applied to estimate generative models without any explicit density function, which instead take the game-theoretic approach: learn to generate from training distribution via 2-player games.

Variational Autoencoders

Posted on 2020-01-09 In Unsupervised learning , VAE Disqus:

This is a concise introduction of Variational Autoencoder (VAE).

An Introduction to Bloom Filter

Posted on 2019-12-22 In Algorithms , Bloom filter Disqus:

When we check and filter out the duplicates for a web crawler, bloom filter is a good choice to curtail the memory cost. Here is a brief introduction.

Likelihood-based Generative Models II: Flow Models

Posted on 2019-12-22 In Unsupervised learning , Likelihood-based models , Flow models Disqus:

Flow models are used to learn continuous data.

Likelihood-based Generative Models I: Autoregressive Models

Posted on 2019-12-17 In Unsupervised learning , Likelihood-based models , Autoregressive models Disqus:

The brain has about 10¹⁴ synapses and we only live for about 10⁹ seconds. So we have a lot more parameters than data. This motivates the idea that we must do a lot of unsupervised learning since the perceptual input (including proprioception) is the only place we can get 10⁵ dimensions of constraint per second.
(Geoffrey Hinton)