Re: comparison of large data sets



I try to describe whole situation.
I have 5 systems with similar amount of data.
One of them is a master, so I want to check differences between master
and each of the others. There must be 4 comparisons.
So Master system takes a part in 4 comparisons.
I have flat files of exported data from each systems. I'm not able to
access the data directly from databases.
Report is executed once in a month. Every month I get new data, which
partly are different, especially states.


> Yes a database will likely do it faster than a simple standalone
> program. A good choice would be (if available) one of the better DBMS
> products like ORACLE, DB2, SQL Server, FIrebird/Interbase, and a few
> others. THey all should be able to handle this volume of data.

I do some tests now.
It is difficult to persuade the customers to buy oracle database, so I
try something open source now:)


>
> >
> > Do you have any point of view, what shell I use to do this
>
> If you really are using shell scripts, I'd say avoid those. If you
> really want to "script" this, use PERL.

I was thinking of Perl, or python:)
I wonder if it should be faster to do it in database or use sort
algorithms implemented in C or Perl.

.



Relevant Pages

  • Re: Replication problems
    ... An error occurred while accessing the DHCP database. ... DHCP server event log for more information on this error. ... browser for the domain on transport NetBT_Tcpip_{99B638B5-2C0B. ... master browser is stopping or an election is being forced. ...
    (microsoft.public.windows.server.active_directory)
  • RE: SQL Server failing to start (directly effecting sales!!!) Error: 823, Severity:
    ... You should consider calling Microsoft Technical Support as your master ... They will want your SQL Server Errorlogs ... that the machine with the database has hardware that is failing. ... Even if you are planning to reinstall you should make a backup copy ...
    (microsoft.public.sqlserver.msde)
  • Re: Table Normalization
    ... To master it requires a great deal more work. ... There are so many categories to work in (database design, queries, forms, ... that they'd flub a number of the answers, ... make the same mistake twice, and they don't let others make the same ...
    (microsoft.public.access.tablesdbdesign)
  • RE: Multi user database
    ... manner and push updates to the backend. ... the other user updates table c then run the append query. ... Right now I can determine when the master table is updated, ... the database they should 'share' the data not update to/from local copies. ...
    (microsoft.public.access.modulesdaovba)
  • Copy FE of DB when it is already open on the server
    ... I just recently changed my database that I'm running from a monolithic ... They click a shortcut, which used to lead to the old ... network (the master copy that Tony's program copies out to each user) ... by itself (as apposed to putting it in the same folder with the ...
    (comp.databases.ms-access)