Programming Languages

Waren Long, source{d}.

Programming Languages

Waren Long, source{d}

Hercules Killing the Lernean Hydra

Read this on your device

About me

About source{d}

History

The dataset

The dataset

apache/spark@7026ee23

Quantization

We apply Snowball stemmer to words longer than 6 chars

Quantization

apache/spark@7026ee23

Transportation Problem

apache/spark@7026ee23

EMD as minimum-cost flow

$$ \left\{ \begin{array}{lll} \min & \sum_{i=1}^S \sum_{j=1}^D ~ x_{i,j} c_{i,j} \\\\ s.c. & \sum_{j=1}^D x_{i,j} = 1 & ~~i = 1,...,S \\\\ & \sum_{i=1}^S x_{i,j} = 1 & ~~j = 1,...,D \\\\ & x_{i,j} \geq 0 & ~~i,j = 1,...,S,D \end{array} \right. $$

Transition

Matrix

Importance

"Importance" is defined recursively: we gain ego if we contribute to repositories to which other important developers contribute.

\[x_ {AR} = \frac{w_ 1 x_ 1 + w_ 2 x_ 2 + w_ 3 x_ 3 + ...}{\lambda} = \frac{1}{\lambda} \sum w_ i x_ i\]

We do not distinguish developers and repositories here.

Transition

Matrix Sorted

Language

Competition

Correlation

Matrix

Future

Bonus: blog posts

Thank you