My Christmas Goodies

from the two best kids in the world.

See and download the full gallery on posterous

Posted via email from The Soistmann Family

Posting del.icio.us Links to WordPress: Finishing Up

On Wednesday, I posted more info about how to clean up my weekly del.icio.us links. There are a few things I’d like to do before I wrap this up.

  1. change all tags and attributes to lowercase
  2. close every dt element
  3. close every dd element
  4. make things a bit more automatic

If we take a closer look at the code for each entry we will see a pattern.


One line has a <DL> followed by the anchor. The next line has a <DD> followed by my comments.

<DT><A HREF="url" LAST_VISIT="1238086010" ADD_DATE="1238086010" TAGS="tagone,tagtwo">Link text</A>
<DD>comments

The only thing that makes this tricky at all is that sometimes the comments span more than one line. We can get around this fairly easily though. All we need to do is put the closing </dd> before all the <DT> tags except the first one. Let’s make that easier by changing the first one to lowercase. We’ll change part of what we did yesterday to accomplish this. Instead of replacing

<DL><p>

with

<dl>

we will replace

<DL><p><DT><A HREF=

with

<dl><dt><a href=

The rest is of the cleanup is pretty straightforward.


Replace

<DT><A HREF="

with

</dd><dt><a href="

and

</A>

with

</a></dt>

and then

LAST_VISIT=[^<]*TAGS="

with

tags="

since I don’t need two of those attributes anyway.

And I almost forgot

<DD>

with

<dd>

Wrap it all up and we have

grep '^> ' < links.diff |awk '{sub(/<DL><p><DT><A HREF=/,"<dl><dt><a href=")};{sub(/<\/A>/,"</a></dt>")};{sub(/<DT><A HREF=/,"</dd><dt><a href=")};{sub(/<DD>/,"<dd>")}{sub(/LAST_VISIT[^<]*TAGS=/,"tags=")};{sub(/^> /, "")};!/<\/DL>/{print}' > foo.html;echo "</dl>" >> foo.html

All we need now is to make the whole process more automatic. Since we have to add that line break in the old export file we can change things up once again to do that automatically. And since we will probably want to save this as a shell script, we can go ahead and make it more readable. I changed a couple of things I didn’t detail here and this is what I ended up with:

First I generalize a bit so I can change things later if I want to

diff $OLDLINKS $NEWLINKS |grep '^> ' |awk '{sub(/<\/A>/,"</a></dt>")};{sub(/<DL><p><DT><A HREF=/,"<dl><dt><a href=")}{sub(/<DT><A HREF=/,"</dd><dt><a href=")};{sub(/<DD>/,"<dd>")}{sub(/LAST_VISIT[^<]*TAGS=/,"tags=")};{sub(/^> /, "")};!/<\/DL>/{print}' > $MYLINKS;echo "</dl>" >> $MYLINKS

then decide on path names (I like to let FireFox save in Downloads automatically and I’m going to delete the new links file anyway, so I set the pathname accordingly.)

export LINKSDIR=$HOME/Documents/Personal/blogging
export OLDLINKS=$LINKSDIR/old-delicious.htm
export NEWLINKS=$HOME/Downloads/delicious-`date "+%Y%m%d"`.htm
export MYLINKS=$LINKSDIR/mylinks.html

then we make our new links file the old one for next week. We should also add that line break while we’re at it (and remove the new links file)

awk '{sub(/<DL><p>/,"<dl>\n")};{print}' < $NEWLINKS > $OLDLINKS
rm $NEWLINKS

and I like to go ahead and open my links file so I can make any quick edits and then post

mate $MYLINKS

I save it and then put it in PATH and make executable

sudo mv preplinks /usr/bin/
sudo chmod 755 /usr/bin/preplinks

You can grab the script here and do the same.

Now every week I go to del.icio.us and export my bookmarks as html and then I run

preplinks

and TextMate launches with my html all ready to be checked and posted.

Works for me.

This is the last in a series of posts. The first two posts are here and here.

Cleaning Up My del.icio.us Links

On Monday, I posted some info about how I am thinking of posting my weekly links.

Today I want to make one correction to the process, talk details about how to clean up the diff file, and then put together a quick script to do that part automatically. Once again, I am going to do this for the first time as I write this. I will summarize the process below.

First, the correction. After my first use of this method I discovered that one more quick edit to the html export will make the parsing of the diff file much easier. Before I move ~/delicious.htm to ~/delicious-old.htm I need to add a line break just after <DL><p>. It may not seem like much but it makes a big difference.

Actually, as it turns out, this is fairly easy to do with awk and grep. Let’s take a look at exactly what I want to do first.

I am only interested in lines that start with > and a space so I start with

grep '^> ' < links.diff

I want to replace the <DL> with <dl> and I don’t need the <p> at all. So now I have

grep '^> ' < links.diff |awk '{sub(/<DL><p>/,"<dl>")}

Now we get rid of the > and the space at the beginning of each line.

grep '^> ' < links.diff |awk '{sub(/<DL><p>/,"<dl>")};{sub(/^> /, "")}

Then we don’t print the last line at all.

grep '^> ' < links.diff |awk '{sub(/<DL><p>/,"<dl>")};{sub(/^> /, "")};!/<\/DL>/{print}'

This gives me everything I need but I still have uppercase tags and attributes, some attributes I don’t really care about, and none of the elements are closed. We can take care of closing the <dl> with a simple echo “</dl>” after it.

echo "</dl>"

So, if we want to save all this to a file we can do this.

grep '^> ' < links.diff |awk '{sub(/<DL><p>/,"<dl>")};{sub(/^> /, "")};!/<\/DL>/{print}' > foo.html;echo "</dl>" >> foo.html

Now all I need to do is clean up those uppercase letters and close all the other elements. I’ll take a look at that on Friday.

This is the second in a series of posts. The first post is here and the next one is here.

Posting del.icio.us Links Weekly to WordPress

I’ve been using del.icio.us to share links since 2005. I’ve always used another method for bookmarking links for myself, but del.icio.us has been my favorite method for the sharing of interesting links. Before del.icio.us I had a separate linkblog so right away I wanted a way to display my shared links in a similar format. I started out by replacing my linkblog with an html rendering of the RSS feed from del.icio.us. I quickly realized that I wanted more than that so I set the blog back up and used a cron job to auto-post my links to the WP database. I’ve written about all of this before.

After a while, I gave up on the linkblog completely and just used a widget to show the links on my blog. Not quite what I wanted but good enough for a while. Recently I decided to set up the daily blog posting feature that del.icio.us provides. This is a very nice feature but doesn’t work well for me because my links come in waves. So, I turned that off earlier this week and set off in search of a way to post the links as a weekly roundup. I’ve seen other sites do this and I like it a lot.

After a few quick searches, I didn’t find anything I thought was worth spending time fooling with. It seems to me it’s just as easy to come up with something on my own. As a hacker I would prefer something as automatic as possible, but I don’t mind having to do something manually. I will probably want to tweak the weekly posting a touch anyway.

The first thing that came to mind was using the RSS feed but I dismissed that because it will only show a maximum of 100 items. That would probably do for my purposes but I’d like to go ahead and set up something I don’t have to worry about – did I get all the links? etc.

So I decided on a different approach. I haven’t done any of this yet. I am going to work on it while I write this.

Here is the plan:

  1. export the links as html
  2. grab out the html I need and paste it into a new post in WP
  3. post it

Simple, except for a few points.

I actually came up with this idea a few days ago and I grabbed an export then. I checked my blog and found that the latest link posted was the trash vortex page at greenpeace.org so I simply removed all links above that and saved this file as ~/delicious.html.

Remove all html above the last posted link

Now it’s time to grab the new links for this week, so I go to del.icio.us and export the html and save it to the desktop. Then,

mv ~/delicious.htm ~/delicious-old.htm
mv ~/Desktop/del*.htm ~/delicious.htm
diff ~/del* > links.diff

The only thing to do now is clean it up and post it. Let’s start by doing it manually. I’ve stripped most of the new links out for demonstration. Take a look.

Diff file

First, I remove the first three lines and the last five lines. I’ve run a few tests now and it looks as though this will always be the case. This should make automation easier. This procedure is obviously going to require a bit of manual intervention so I should be able to notice when a problem crops up.

After removing those lines I am left with a bunch of lines like those below.

> <DL><p><DT><A HREF="http://online.wsj.com/article/SB123731266862258869.html" LAST_VISIT="1238172267" ADD_DATE="1238172267" TAGS="fun,economics,games,culture,scrabble,words">Scrabble and Other Games Have Overvalued Points - WSJ.com</A>
> <DD>Scrabble is a great game and should be left alone.

The only thing necessary to make this “work” is to remove the > at the beginning of each line, but we will make it “right” by changing uppercase tags and attributes to lowercase, closing all elements, and wrapping all of it in

<dl></dl>

Then I copy and paste it into a new post in WP and I’m all set. Requires a bit of input but not hard to do. I will see how much I can automate i on Wednesday.

This is the first in a series of posts. The next post is here

Scraping The iTunes Store

I recently wrote a series of posts detailing the way I chose to scrape iTunes for ratings information for Bailout America, an iPhone game we released recently.

Read more at thedoedoeblog.

LeftLink

LeftLink was a collection of interesting links with re-written headlines. The project slowed to a complete halt due to the time necessary to maintain it manually.

So, it was brought back to life as an aggregation of progressive info using the simple mechanism I put together for iPhoneDeck.

GetMetsTickets.com

I set up my newest website, getmetstickets.com, as a way to help fans who might not otherwise be able to attend.

Learn more about my motivation on my blog.

AppTheater

AppTheater is a video sharing site centered around iPhone apps. You can watch thousands of iPhone related videos or upload some of your own.

We set this up as a preview of apps. Many of the videos give you a nice demo of the app for your consideration before purchase. You can also review comments and see what other users are saying about the app (or the video).

Check it out at apptheater.com and upload some videos of your own.

If you don’t see an app you’d like to preview, contact me or use the contact form on the site.

Bailout America

I served as program manager for an iPhone app which was recently released at the App Store. The game is called Bailout America (iTunes Link) and it is a real blast.

Check out the game’s homepage here.

Seems to be a hit so far. We’ve had a couple of mentions that I’ve seen online – AppCraver.com and NY Times.

Alinks Repairs

Great Programmer! Very easy to work with, good comunication skills, got what I wanted, understood every problem 100%! Highly recommend for your project! ….Thanks!

alinks script repairs and mods

WordPress as a CMS

I set up a custom WordPress theme to be used as a complete CMS for this website.

Google Maps – Click to Add

I set up a map for a client who wanted to track firefly sightings in Europe.

I don’t remember all of the specs on this one but I know

  1. Users can click to add a sighting
  2. Users can upload sightings in bulk if they have lat and long data
  3. It uses the lat/long scheme used in the UK
  4. It updates live

It was built with functional and valid xhtml so that it could be placed into another site and styled accordingly. I don’t know where this ended up on the web, but you can see my fully functioning (as 7/12/2007) version at http://mashedpotatoearth.com/fireflies/

ReallyCheckYourself

Boist did a great job on this project. It was completely quickly and accurately. He even went above and beyond to ensure it was done right.


ReallyCheckYourself

Google Maps

I created a Google Maps site which helped users locate medical testing centers. You can see it at http://www.reallycheckyourself.org/.

WordPress Theme

I finished up a custom WordPress theme for this website.

1000 Moms

I took an existing PHP website set up by a novice and worked in some real programming without disrupting what was already very comfortable for the client. You can see the site at http://www.1000moms1000dollars.com/.

Webpage Replication Service

I wrote a program to edit thousands of static webpages.

Event Tickets Center

I provided a combination of PHP, Perl and JavaScript to solve a cross domain security problem for this website. This was much more difficult than I’ve made it sound but the details are protected by NDA.

Alinks

I provided several modifications for this script for a couple of different clients.

Startup Studio

I set up and configured a Wordpress powered site for this client. The work included a custom template based on the client’s original design, several plug-in modifications, and two custom plug-ins. The site is inactive now, but you can see it at http://startupstudio.com/.

Data Extraction

This client wanted a list of all YouTube videos on a certain topic. I found some of the work and posted it. Like most scraping work, it may not work anymore. It’s here if you want to check it out.

WordPress Theme

I do a lot of WordPress work – themes, plugins, core mods, etc.

This client wanted a standards compliant theme with an adsense style look. He provided an image and I built the xhtml/CSS and the WP theme.

I am not sure where you can find his use of it, but there is a slightly modified version here.

Karl Jackson

Worked hard to get my project functioning. Excellent programmer.

Perl Script Install and Customization

Script Install and Customization

Set up an off-the-shelf Perl script for automated web-based marketing and customized it for this client’s special needs.

Website Replication

A guru in every sense of the word! This guy should be given a goverment national asset award for services to the United States! Hire him now!

Website Replication

Amazon API

I’ve been experimenting with Amazon.com’s APIs for several projects.

You can see my work here.

Experience