ViewVC Help
View File | Revision Log | Show Annotations | View Changeset | Root Listing
root/ns_dev/Python/NinoCode/Active_prgs/Redgrave/Amazon_NamesNormQC.py
Revision 980 - (view) (annotate) - [select for diffs]
Modified Fri Feb 20 18:54:49 2026 UTC (5 weeks, 1 day ago) by nino.borges
File length: 47799 byte(s)
Diff to previous 967
Added support for the similar names exclusions in this version.15
Revision 967 - (view) (annotate) - [select for diffs]
Modified Thu Jan 8 21:27:51 2026 UTC (2 months, 2 weeks ago) by nino.borges
File length: 46354 byte(s)
Diff to previous 966
First working version of the build_similar_names function.  Some QC was done and it does appear to be working.
Revision 966 - (view) (annotate) - [select for diffs]
Modified Wed Jan 7 20:45:17 2026 UTC (2 months, 2 weeks ago) by nino.borges
File length: 46353 byte(s)
Diff to previous 965
Added return_person_all_name_variations and added the additional name fields, now that it was approved.  This seems to work well.  I also added the version version of build_similar_names, which I'm still debugging.
Revision 965 - (view) (annotate) - [select for diffs]
Modified Tue Dec 23 18:20:42 2025 UTC (3 months ago) by nino.borges
File length: 40990 byte(s)
Diff to previous 956
Adding support for expansion of international email address domains, for client email addresses, because we noticed that international client addresses usually have domain aliases for other international domains.
Revision 956 - (view) (annotate) - [select for diffs]
Modified Fri Dec 5 16:50:27 2025 UTC (3 months, 3 weeks ago) by nino.borges
File length: 39285 byte(s)
Diff to previous 944
Added return_str_string2 which is a more complicated version of the original method.  I kept the original in there but I'm using using this more updated method when ExportFullSTRList is called.
Revision 944 - (view) (annotate) - [select for diffs]
Modified Wed Nov 5 18:00:35 2025 UTC (4 months, 3 weeks ago) by nino.borges
File length: 36247 byte(s)
Diff to previous 920
adjusting the caps on the return_str_string method.
Revision 920 - (view) (annotate) - [select for diffs]
Modified Thu Aug 7 20:25:28 2025 UTC (7 months, 2 weeks ago) by nino.borges
File length: 34554 byte(s)
Diff to previous 877
added methods for partial matches and sloppy searches.
Revision 877 - (view) (annotate) - [select for diffs]
Modified Fri Jan 10 22:03:53 2025 UTC (14 months, 2 weeks ago) by nino.borges
File length: 32676 byte(s)
Diff to previous 869
Adds a new integrity check that will look for email addresses outside of email address fields in the MAL.  Also updated the existing email address integrity check to do a better job of finding duplicates across work email and alt email.
Revision 869 - (view) (annotate) - [select for diffs]
Modified Fri Dec 20 16:57:01 2024 UTC (15 months ago) by nino.borges
File length: 30961 byte(s)
Diff to previous 854
This version adds a RowNumerIntegrityCheck but I'm actually doing this a different way now, separate from this lib, so I may just remove this. 
Revision 854 - (view) (annotate) - [select for diffs]
Modified Thu Dec 12 18:26:19 2024 UTC (15 months, 2 weeks ago) by nino.borges
File length: 28072 byte(s)
Diff to previous 853
Updated the dates_as_counsel attribute so that it's now a list of two value tuples, instead of just a raw string.  this parsed version will work much better when I attempt to determine if split role attorneys will fall within one of those ranges.  Also updated search_by_email method to search both the work_email_address and the alt_work_email_address, exiting as soon as it finds the first match.  There shouldnt be any duplicates, including the alt email addresses, across rows.
Revision 853 - (view) (annotate) - [select for diffs]
Modified Thu Dec 12 16:18:41 2024 UTC (15 months, 2 weeks ago) by nino.borges
File length: 26788 byte(s)
Diff to previous 850
This version adds the method RunMalEmailAddressIntegrityCheck which will test and report on email address duplicates across rows and tabs.
Revision 850 - (view) (annotate) - [select for diffs]
Modified Wed Dec 11 15:42:57 2024 UTC (15 months, 2 weeks ago) by nino.borges
File length: 25663 byte(s)
Diff to previous 847
In this pretty substantial update, I added support for the adding information from the split role attorney tab.  I also added methods for searching by the unique row number in the spreadsheets, searching by UUID, and support for searching by any field which results in a list instead of just exiting after finding the first value.  This version adds both the unique row number from the spreadsheet and adds the dates_as_counsel fields to the dataclass.
Revision 847 - (view) (annotate) - [select for diffs]
Modified Tue Dec 10 17:59:30 2024 UTC (15 months, 2 weeks ago) by nino.borges
File length: 23052 byte(s)
Diff to previous 846
Finally changed this so that it's a library, so I removed the run code at the bottom.  Programs will now be able to call this library now that the pickle will work properly.
Revision 846 - (view) (annotate) - [select for diffs]
Modified Tue Dec 10 17:22:55 2024 UTC (15 months, 2 weeks ago) by nino.borges
File length: 23675 byte(s)
Diff to previous 844
They changed a column name to 'comments', so updated dataclass. also added some warning messages when the consilio override ingestion cant find a matching email address in the MAL.  I'm seeing many of these.
Revision 844 - (view) (annotate) - [select for diffs]
Modified Fri Dec 6 21:56:09 2024 UTC (15 months, 2 weeks ago) by nino.borges
File length: 23349 byte(s)
Diff to previous 838
This version fixes a bug where it wasnt importing the downgrades tab because of an error in missing logic.  Also adds a pseudo private method that does some smarter deduplication in a set because I wanted to make sure I only had unique addresses in the initial RE result, looking for that as an error.  If I have 3 email addresses where only one should exist (like in the from) but they are the same email address, I wanted to make sure I wasnt raising a warning because they are not unique.  if there is more than one unique, that is a warning.  Finally also added support for providing a warning if the override name has a different last name from the MAL for the same record.  I'm now using consilio values for my override but there is concern they could be completely wrong.  So far I'm finding them more accurate actually and not totally off. however still want to be warned.
Revision 838 - (view) (annotate) - [select for diffs]
Modified Mon Dec 2 20:03:00 2024 UTC (15 months, 3 weeks ago) by nino.borges
File length: 19183 byte(s)
Diff to previous 837
the last updated values from the MAL were giving me issues when trying to save out a pickle file.  I think it's actually reading these as a datetime obj, which is why I get this error.  For now going to comment this out and will add it back in once I have a reason to use it.
Revision 837 - (view) (annotate) - [select for diffs]
Modified Mon Dec 2 19:18:05 2024 UTC (15 months, 3 weeks ago) by nino.borges
File length: 19099 byte(s)
Diff to previous 836
Added support for the "full name overide" list because I'm seeing lots of inconsistencies in the MAL.  I need a way to overide in some instances based on email address.  Also updated for the new 12.1 MAL, which added a column (last updates), changed a col name and added more rows to both tabs.
Revision 836 - (view) (annotate) - [select for diffs]
Modified Wed Nov 27 16:53:01 2024 UTC (15 months, 4 weeks ago) by nino.borges
File length: 16475 byte(s)
Diff to previous 835
This version added some strip due to values in the MAL having beginning or trailing whitespace.
Revision 835 - (view) (annotate) - [select for diffs]
Modified Wed Nov 27 16:08:34 2024 UTC (15 months, 4 weeks ago) by nino.borges
File length: 16353 byte(s)
Diff to previous 834
Added support to do the actual compares and output the issue values to a log file, further refining the logic.  Also added some support to add some manual manipulation on some of the values for things where inconsistent in the MAL.
Revision 834 - (view) (annotate) - [select for diffs]
Added Tue Nov 26 20:04:02 2024 UTC (16 months ago) by nino.borges
File length: 13814 byte(s)
This program will assist with the process of performing Names Normalization QC on the Amazon privilege logs.

Convenience Links

Links to HEAD: (view) (annotate)

Compare Revisions

This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a