| 1 |
< |
=== Here I was asked to group dup documents using a field where they are putting in the |
| 2 |
< |
dup values === |
| 1 |
> |
=== Here I was asked to group dup documents using a field where they are putting in the dup values === |
| 2 |
> |
== DO NOT USE. Moved updated code to standalone prog called PerligoDupeGroup.py == |
| 3 |
|
== First I made the matrix, flatening the data into “groups” == |
| 4 |
|
>>> matrix = {} |
| 5 |
< |
>>> contents = open(r"/Users/ninoborges/Dropbox/Misc/359.txt").readlines() |
| 5 |
> |
>>> contents = open(r"C:\Users\eborges\Box Sync\Client\Honeywell\Perligo\20150109_dedupeProj\NearDupesExport.dat").readlines() |
| 6 |
|
>>> matrix2 = {} |
| 7 |
|
>>> count = 1 |
| 8 |
|
>>> for line in contents: |