Save Information to DB at Crawling



I am trying to make a crawling program to grab information and store them
into the database.
The web site is structured as following.

REGION
CATEGORY
PROPERTY LISTING

In the site there are about 50 regions each region has 20 category or less
and at the maximum one category
can be as many as 2000( can display 20 property for each page). In order to
get all information, my crawler is going to each property page using regular
expression
to extract specific info( Price, BR, Contact Info etc)..

I have problem to decide when and where I can save it to database. Note that
this crawler is scheduled to go to the website
to get info every day and if the property information is not changed from
last modification date the crawler is going to skip
the property.

I create the following tables to store those information

Region Table
ID, Region Name

Category Table
ID(1~20), Category Name

Property Table
ID, Category, Name, Address, Price, Contact Info, Bed Rooms, Contact Info,
Location(Lat), Location(Lon)






.



Relevant Pages

  • Re: Save Information to DB at Crawling
    ... The web site is structured as following. ... In order to get all information, my crawler is going to each property page using regular expression ... I have problem to decide when and where I can save it to database. ... than to suggest you take a class on Oracle PL/SQL programming and that, ...
    (comp.databases.oracle.server)
  • Re: file upload storage - database vs folder
    ... Suppose I do store my images for a web site in a BLOB ... > I have used both techniques whilst working with oracle database. ...
    (microsoft.public.dotnet.general)
  • Re: Newbie with questions.
    ... reflects the information i have in the database. ... will be a pretty big web site, the pictures on the database would be thumb ... typically you don't store images in an Access database, but instead store them in a folder and store the "links" to ... you may be better off asking these types of questions in a newsgroup devoted to website programming. ...
    (comp.databases.ms-access)
  • Re: Multi Color DropDownList
    ... Or create a field in the database and store the color name and display ... accordingly on web site. ... Kinda expensive idea since you would have store this information in db. ...
    (microsoft.public.dotnet.general)
  • Re: Loading Default One Time
    ... global.asa file and session and application variables at all? ... > I store default information for a particular web site. ... session rather than quering the database every time. ...
    (microsoft.public.inetserver.asp.general)