Broadcast
variables are variables that are shared throughout the cluster. Broadcast
variables need to be able to slot in memory on one machine. which means that
they mustn't be something super massive, sort of a massive table or large
vector. Secondly, broadcast variables area cannot be changed, which means that
they can't be modified.