[Tech] Re: [adcom] Database Merge Duplicates Program nearly done

Jamie O'Keefe jokeefe at jamesokeefe.org
Tue Oct 23 22:51:51 EDT 2007


We have three sources of data:

1.  A database of our past contributors
2.  A database of our voters
3.  A database of past supporters

All of these were gotten through sweat and toil of current and past
GRP activists.

The Party codes are:

F = Rainbow Coalition Party designation
G = Green Party USA designation
J  = Green-Rainbow Party

I don't know if 2959 Democrats contributed to us at this time.  I also
don't know how many of these records are still accurately listed as
Democrats.  We need to update them.

Thanks for your questions.  I hope I have answered them well.

Jamie

On 10/23/07, Larry Ely <tetrahedrons at crocker.com> wrote:
> Jamie,
>
> Could you please contextualize this report a bit more?  What is the
> database you started with?  How did you get it?  What did it cost?  What do
> you mean by supporter records?  Are supporters those persons who have
> contributed monetarily to GRP,  those persons who have voted GRP, or
> both?  Can you please tell us what the F, G, and J designations stand
> for?  Under D, you list 2959.  Does this mean that 2959 Democrats have
> contributed money to the GRP?
>
> Thanks for your work on this, which is so essential to getting ourselves
> further organized.
>
> Larry
>
>
> At 03:40 AM 10/21/07, Jamie O'Keefe wrote:
>
> >I have a working program to merge duplicates from the effort to
> >combine all supporter records into one database.  Dan will be happy to
> >know that each new record notes its voter id and contributor db id.
> >The whole process took about a minute to run.
> >
> >I need to correct the phone number matching and we need to finish
> >reviewing the names for errors, but I am hopeful that it will be
> >finished by tomorrow night.
> >
> >I started with 61856 records and after duplicates were merged, 36182
> >records were left.  We haven't corrected all of the names, so there
> >are more duplicates to be found.  With this uncorrected data here are
> >some stats:
> >
> >Address info
> >
> >Bad Address     408
> >Updated Add     5967
> >Other Addr.     27000+
> >
> >Party breakdown
> >
> >F       192
> >G       1267
> >J       8580
> >D       2959
> >R       112
> >U       2468
> >
> >Note that this only has the latest F/G/Js.  We have note combed
> >through the voter database to correct anyone's record who might have
> >moved out of state or changed party.
> >
> >email info
> >
> >email   9222
> >blank   26960
> >
> >Anyway, this is great progress that I hope the campaigns will be able
> >to use soon.  Once we have merged the duplicate records, I will load
> >them into our web db and then give out logins to the campaigns.
> >
> >peace,
> >
> >Jamie
> >_______________________________________________
> >AdCom mailing list
> >AdCom at green-rainbow.org
> >http://www.green-rainbow.org/mailman/listinfo/adcom
> >To email Administration Committee members: adcom.members AT green-rainbow
> >DOT org
>
>
> _______________________________________________
> AdCom mailing list
> AdCom at green-rainbow.org
> http://www.green-rainbow.org/mailman/listinfo/adcom
> To email Administration Committee members: adcom.members AT green-rainbow DOT org
>


-- 
peace,

Jamie
--
James O'Keefe
www.jamesokeefe.org


More information about the Tech mailing list