A peril of having a highly dynamic web siteWanderingThoughts, this blog, is built on top of DWiki, my sprawling wiki-oid program. DWiki is layered on top of a normal Unix directory hierarchy and does things by having different ways of looking at it (called 'views' in DWiki terminology); all of DWiki's support for blogs is implemented with general features that can be used on any directory. The only difference between WanderingThoughts and say the DWiki help area is that WanderingThoughts is set up to default to the 'blog' view and has some template skinning to add the sidebar. (This way I could tell myself I was just writing some small additional features for my existing program, instead of yet another blogging system.) Other things are also done as general features. The calendar and range
based blog navigation is 'virtual directories' that can be applied to
any real directory. Atom syndication feeds are just another view of a
directory hierarchy, any directory hierarchy. Because you can compose
this generality together, it's trivial to do things like get an Atom
feed of the five most recently changed pages in all of CSpace; just
tack on ' All of this is very general and dynamic (since everything is generated on the fly). And therein lies our peril, right at the intersection of all of these dynamic website features. Follow along:
Between the top level directory plus category subdirectories, times day and month and year pages for every day with posts, WanderingThoughts probably has thousands of subdirectories. Each of these directories has its own Atom syndication feed, each of which can be autodiscovered by anything that crawls CSpace through those handy links. Boy, I hope that any crawlers doing that are smart enough to realize they have a bunch of duplicate feeds. Boy am I an optimist. (DWiki can't mark the calendar navigation links 'nofollow', because I
want web spiders to follow them to find older blog entries. How else
are they going to do it? (Web spiders not infrequently shy away from
links with ' |
These are my WanderingThoughts GettingAround This is part of CSpace, and is written by ChrisSiebenmann. * * * Atom feeds are available; see the bottom of most pages. Categories: links, linux, programming, python, snark, solaris, spam, sysadmin, tech, unix, web |