You are not logged in.

#1 2009-12-14 22:08:42

Kitty
Member
From: The Burning Desert
Registered: 2008-01-11
Posts: 88

icanhascheezburger.com image scraper

icanhascheezburger.com is one of my favorite sites for lolcat image macros. But the site is a horrid mess that takes forever to load. The RSS feed is Sage is even worse. So I decided to fix that.

This is the simple script that scrapes only the pictures from the feed, for easy viewing.

$ cat bin/lolcat-feed.sh
#!/bin/sh

curl --silent http://feeds.feedburner.com/ICanHasCheezburger | sed -e '\%^<img.*http://icanhascheezburger.files.wordpress.com.*jpg%!d'

Which outputs a html list of the pictures, which you can then save to a local file to view with your browser.
I have it set up in a cron job like so:

$ crontab -l
#m h  dom mon dow   command
45 *  *   *   *  $HOME/bin/lolcat-feed.sh > /dev/shm/lolcat-feed.html

Lolcats are much happier now.


/etc/rc.d/ is where daemons reside. Beware.

Offline

#2 2009-12-15 13:26:35

arch0r
Member
From: From the Chron-o-John
Registered: 2008-05-13
Posts: 597

Re: icanhascheezburger.com image scraper

hrhr great! smile

Offline

#3 2009-12-15 13:38:56

Dirk Sohler
Member
From: Hamburg, Germany
Registered: 2009-10-03
Posts: 109

Re: icanhascheezburger.com image scraper

wget $(wget http://icanhascheezburger.com/ -qO - | grep -m1 wordpress.com/files | sed s/".*\(http:\/\/.*\.jpg\).*"/"\1"/g) -qO - | display -window root -

This does the same, but sets the image as wallpaper. One might want to edit the parameters for display for setting the wallpaper centered or something smile

Last edited by Dirk Sohler (2009-12-15 13:39:08)

Offline

#4 2009-12-15 15:18:19

Gen2ly
Member
From: Sevierville, TN
Registered: 2009-03-06
Posts: 1,529
Website

Re: icanhascheezburger.com image scraper

Nice.


Setting Up a Scripting Environment | Proud donor to wikipedia - link

Offline

Board footer

Powered by FluxBB