Window Support Software

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Friday, 8 October 2010

Screen Scraping Content To Excel via Web Connectors

Posted on 15:02 by Unknown
My friends love to tease me for being a "spreadsheet guy" because I make spreadsheets for nearly everything under the sun.  Mostly product comparisons, financial analysis(ish), stuff like that.

One spreadsheet that I work on very frequently deals with 401k fund options from my employer.  Due diligence on 150 funds is a time consuming process.  To help me make semi-educated picks I browse finance.yahoo.com to look at stats such as the beta, expense ratio, manager tenure, turnover, yield, (1, 3, 5, 10) year load adjusted returns, etc.

The problem? These stats change constantly, and manually updating a spreadsheet is a chore!

Recently I was investigating data streams for a keystroke biometric project that I'm working on and came across some information that led me to believe that creating data connections through Excel would be a snap. 

If you open MS Excel 2007+ and go to the Data tab you'll notice there are several external data connection options (Access, Web, Text).

If you jab the "Web" button it'll open your default IE home page within a window with Yellow boxes and black arrows all over.  These boxes identify html table structures.  By selecting one (or more) of these tables you are indicating to the web connector that you'd like to import that content. Click ok and it'll give you the option to import that content into an existing worksheet or a new one. 

Rather than creating unique connections via the UI, you could create and point to saved data connectors (.iqy files).  Click the "Existing Connections" button and you'll uncover 3 examples from MSN Money, 2 of which are templates that once connected allow you to map your variables. 
I've set my connections to update upon file open and that process takes approximately 1.5 minutes for 24 connectors, so just under 4 seconds per record.  I'm assuming that I can extrapolate that number out to 9 minutes for the 150 connectors I'll eventually need (based upon a template using a single .iqy file I wrote).  While I'm not very patient when it comes to waiting for files to open, the reality is that having to do this manually would take me about 2 hours, so 9 minutes doesn't seem all that bad.

I wish Access had a feature where I could schedule the imports! But then again I'm sure there is a more elegant solution to my problem here.  So with that said, if anyone has suggestions for alternate ways to extract mutual fund data for analysis (to a database, XML, excel or whatever) I'd be all ears!
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest
Posted in Data Connection, Excel, Screen Scraping | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • Deleting AutoComplete Entries In Outlook
    Your Outlook is not just a mere email client; it is a very efficient personal manager tool. MS Outlook can help you take notes, make journal...
  • Interact Intranet: Automate the Extraction of Binary Profile Pictures for use in Active Directory
    Active Directory can be used as a central repository for the storage of profile pictures.  This source can then feed email, CRM, intranets, ...
  • How to Set Password on Your Windows XP
    Windows XP is the most popular operating system in the world. It has got a number of unique features, which make it a favourite to all. It w...
  • Configure Windows Live Or Hotmail Account In Your IPhone
    IPhone lets you configure your email accounts in it, and now, if you have a Hotmail account, we will give you tips on how to configure that ...
  • Steps To Set Up The Email Filter In Outlook
    As we all know, Microsoft Outlook contains some of the most advanced and novel features which help users to remember and organize their impo...
  • Walk-through of domain configuration for Oracle WebCenter 11g: on Windows Server 2008 x64 & MS SQL 2008
    In my previous post I covered the third step of the "simple" installation topology, Install other Fusion Middleware products , and...
  • How To Tackle The Buffer Overflow Situation
    Normally a program runs on a sufficient amount of memory. However, when the program consumes more memory than required, a phenomenon called ...
  • Oracle WebCenter Interaction: Using multiple logins and URLs to access your portal
    When we purchased Plumtree licenses way back in the day we had visions of using the software to run our intranet and as an extranet for smal...
  • Walk-through of post install tasks for Oracle WebCenter: on Windows Server 2008 x64 & MS SQL 2008
    In my previous post I covered the fourth step of the "simple" installation topology, Configure domain for WebCenter , and will now...
  • Excel Tips & Tricks
    I enjoy spreadsheets and working with numbers.  A lot of this joy comes from the satisfaction of being able to keep track of many different ...

Categories

  • 64bit
  • access a help link
  • Active Directory
  • AD
  • Adaptive Layouts
  • Administrator account
  • Antimalware programs
  • binary
  • collaboration
  • content management
  • content migration
  • CSS
  • Data Connection
  • Data execution prevention
  • Ease your work
  • Eclipse
  • Email accounts
  • Email in Outlook
  • End process
  • Excel
  • hosting
  • iCalendar
  • ics
  • IIS
  • install
  • Interact-Intranet
  • Intranet
  • Jive Express
  • Jive SBS
  • Jive Software
  • Knowledge Directory
  • Liferay
  • Liferay Developer Studio
  • Linux
  • Local Computer Policy
  • login
  • Microsoft chat
  • Microsoft Help
  • Microsoft support
  • Microsoft tech support
  • Microsoft word application
  • Minimum hardware requirements
  • Mozilla Firefox web browser
  • MS SQL 2008
  • New Mail Notification
  • Oracle
  • Oracle ECM
  • Oracle Enterprise Content Management
  • Oracle OpenWorld
  • Oracle WebCenter
  • Oracle WebCenter Analytics
  • Oracle WebCenter Analytics 10.3.0.1
  • Oracle WebCenter Interaction
  • Oracle WebCenter Suite
  • Outlook 2007
  • Outlook Express Address Book
  • Outlook Repair
  • Outlook repair utility
  • Outlook settings
  • Outlook support
  • Outlook tech support
  • Password protection
  • Plumtree
  • portals
  • Programs and Features
  • Publisher
  • RCU
  • redirect
  • Remove AVG toolbar
  • Remove malwares
  • Screen Scraping
  • Search
  • set password
  • SharePoint
  • SQL
  • SQL Server Reporting Services
  • Studio
  • Task manager
  • try Windows 8
  • Underline feature
  • vanity URL
  • Vista problems
  • WCI
  • WebCenter Interaction
  • WebCenter Suite
  • WebLogic Server
  • Windows 7 Support
  • Windows 8 Support
  • Windows 8 transfer
  • Windows Blue
  • Windows Easy Transfer tool
  • Windows live chat support
  • Windows Live Photo Gallery
  • Windows Online Support
  • Windows Server 2008
  • Windows support
  • Windows Vista help
  • Windows XP support
  • WLS
  • Xbox 360 controller

Blog Archive

  • ►  2013 (33)
    • ►  December (1)
    • ►  November (1)
    • ►  October (8)
    • ►  August (2)
    • ►  July (4)
    • ►  June (4)
    • ►  May (3)
    • ►  April (2)
    • ►  March (2)
    • ►  February (1)
    • ►  January (5)
  • ►  2012 (32)
    • ►  December (4)
    • ►  November (5)
    • ►  October (2)
    • ►  September (7)
    • ►  August (1)
    • ►  July (1)
    • ►  June (8)
    • ►  May (2)
    • ►  April (1)
    • ►  March (1)
  • ►  2011 (30)
    • ►  November (3)
    • ►  October (2)
    • ►  September (2)
    • ►  August (4)
    • ►  July (3)
    • ►  June (5)
    • ►  May (4)
    • ►  April (2)
    • ►  March (3)
    • ►  January (2)
  • ▼  2010 (8)
    • ►  December (2)
    • ►  November (1)
    • ▼  October (5)
      • Oracle WebCenter Interaction: Getting Around Adapt...
      • Oracle WebCenter Interaction Studio Input Field Bl...
      • Performing a Redirect in Oracle WebCenter Interaction
      • Screen Scraping Content To Excel via Web Connectors
      • Using iCalendar Files to Enhance Content Items in ...
Powered by Blogger.

About Me

Unknown
View my complete profile