Added support for the similar names exclusions in this version.15
First working version of the build_similar_names function. Some QC was done and it does appear to be working.
Added return_person_all_name_variations and added the additional name fields, now that it was approved. This seems to work well. I also added the version version of build_similar_names, which I'm still debugging.
Adding support for expansion of international email address domains, for client email addresses, because we noticed that international client addresses usually have domain aliases for other international domains.
Added return_str_string2 which is a more complicated version of the original method. I kept the original in there but I'm using using this more updated method when ExportFullSTRList is called.
adjusting the caps on the return_str_string method.
added methods for partial matches and sloppy searches.
Adds a new integrity check that will look for email addresses outside of email address fields in the MAL. Also updated the existing email address integrity check to do a better job of finding duplicates across work email and alt email.
This version adds a RowNumerIntegrityCheck but I'm actually doing this a different way now, separate from this lib, so I may just remove this.
Updated the dates_as_counsel attribute so that it's now a list of two value tuples, instead of just a raw string. this parsed version will work much better when I attempt to determine if split role attorneys will fall within one of those ranges. Also updated search_by_email method to search both the work_email_address and the alt_work_email_address, exiting as soon as it finds the first match. There shouldnt be any duplicates, including the alt email addresses, across rows.
This version adds the method RunMalEmailAddressIntegrityCheck which will test and report on email address duplicates across rows and tabs.
In this pretty substantial update, I added support for the adding information from the split role attorney tab. I also added methods for searching by the unique row number in the spreadsheets, searching by UUID, and support for searching by any field which results in a list instead of just exiting after finding the first value. This version adds both the unique row number from the spreadsheet and adds the dates_as_counsel fields to the dataclass.
Finally changed this so that it's a library, so I removed the run code at the bottom. Programs will now be able to call this library now that the pickle will work properly.
They changed a column name to 'comments', so updated dataclass. also added some warning messages when the consilio override ingestion cant find a matching email address in the MAL. I'm seeing many of these.
This version fixes a bug where it wasnt importing the downgrades tab because of an error in missing logic. Also adds a pseudo private method that does some smarter deduplication in a set because I wanted to make sure I only had unique addresses in the initial RE result, looking for that as an error. If I have 3 email addresses where only one should exist (like in the from) but they are the same email address, I wanted to make sure I wasnt raising a warning because they are not unique. if there is more than one unique, that is a warning. Finally also added support for providing a warning if the override name has a different last name from the MAL for the same record. I'm now using consilio values for my override but there is concern they could be completely wrong. So far I'm finding them more accurate actually and not totally off. however still want to be warned.
the last updated values from the MAL were giving me issues when trying to save out a pickle file. I think it's actually reading these as a datetime obj, which is why I get this error. For now going to comment this out and will add it back in once I have a reason to use it.
Added support for the "full name overide" list because I'm seeing lots of inconsistencies in the MAL. I need a way to overide in some instances based on email address. Also updated for the new 12.1 MAL, which added a column (last updates), changed a col name and added more rows to both tabs.
This version added some strip due to values in the MAL having beginning or trailing whitespace.
Added support to do the actual compares and output the issue values to a log file, further refining the logic. Also added some support to add some manual manipulation on some of the values for things where inconsistent in the MAL.
This program will assist with the process of performing Names Normalization QC on the Amazon privilege logs.
This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.