how proceed information guess task?
i have record containing 250 million website urls, any an ip address, page title, republic name, server ensign (e.g. "apache"), response (in ms), array images on. during moment, annals 25gb boring file.
i'm prying generating several statistics file, such as:
- number ip addresses represented per country
- average response per country
- number images v response time
etc etc.
my doubt is, grasp form scale processing, height collection wuld use(in reasonable time)?
i am open suggestions, ms sql windows flush solaris, suggestions :-) prerogative points dry (don't repeat yourself), i'd move new way any opposite cut required.
any comments works, what's avoided severely appreciated.
Comments
Post a Comment