A more technical post about how I end up efficiently JOINING 2 datasets with REGEX using a custom UDF in SPARK