Subdirectories: NewFeatures.
DWiki's configuration file
DWiki's configuration file has a simple format. Blank lines and comments (any line that has a '
#' as the first non-whitespace character) are just skipped, and everything else is interpreted as a configuration directive to set. Directives can be continued with additional lines by starting the continued lines with whitespace (as in email headers). The continuation whitespace will be turned into a single space in the final, un-continued version of the line.Configuration directives have optional values, which are separated from the configuration item by whitespace. (Whitespace within the value is not interpreted, although trailing whitespace is removed from lines.)
So an example set of configuration file lines might be:
root /web/data/dwiki pagedir pages tmpldir templates wikiname TestWiki wikititle Testing WikiDWiki requires and uses some configuration directives. Unused configuration directives are not errors; all configuration directives (and their values) become part of the context variables available for template
${...}expansion.To simplify life, configuration directives are put through a canonicalization process. This operates like so:
- if root is specified, it must be an absolute path to a directory.
- if something ending in dir or file is not an absolute path and root is defined, DWiki sees if tacking on root results in the right sort of thing.
- any directive ending in dir or file must wind up (possibly after the root prefixing above) being an absolute path to a directory or a file respectively.
- if root is defined and pagedir, tmpldir, or rcsdir are not defined, DWiki sees if directories called
pages,templates, orrcsrootexist under root and if so sets up the configuration directives appropriately.Required configuration directives are:
pagedir,tmpldir,wikiname, androoturl. This means that with defaulting, the minimal DWiki configuration file is:root /some/where rooturl /some/thing wikiname SomeThingConfiguration directives and their meanings:
Core where-to-find-things configuration:
root- If present, this is taken as the root directory that further configuration directives can specify paths relative to.
pagedir- The root directory of the page hierarchy. (Required.)
tmpldir- The root directory of the template hierarchy. (Required; cannot be the same as
pagedir.)usercs- Support checking RCS files for things like page history, page locker, and so on. Whether or not
usercsis set, DWiki refuses to serve files ending with,vor in RCS directories; see InvalidPageNames. As a result, settingusercsis only necessary if you want page history et al to be visible to people visiting the DWiki; you can use RCS yourself on page files without setting it.rcsroot- The root directory of the separate RCS file storage hierarchy; used only if
usercsis on.
Normally, RCS files are expected to be inRCSdirectories underpagedir, where basic RCS commands put them (if you make those directories; DWiki requires you to work this way). With this directive on, the RCS,vfiles for files underpagedirare instead found under here, in a mirror of the directory structure inpagedir, so you have pagedir/foo/barand rcsdir/foo/bar,v. This keepspagedirneater at the expense of requiring some scripting support.Web configuration
wikiname- The short, one-word name of this DWiki. You probably want to have a CamelCased name. This shows up as the name of the breadcrumbs, among other places. (Required.)
wikititle- The full, multi-word title of this DWiki.
wikiroot- The front page of the DWiki; the page you get redirected to when you request the DWiki's root. If this isn't set or doesn't exist, DWiki tries
wikiname's value as a page name; if that doesn't work, people see the DWiki's root directory in a directory view.rooturl- The URL of the directory that is the root of the DWiki instance; use '/' to mean 'the root of the web server'.
publicurl- If set, DWiki puts this directory's URL on the front of DWiki URLs instead of
rooturl.staticdir- The directory to serve static files from. DWiki only serves files from this hierarchy; requests for a directory will fail.
staticurl- The URL of the directory that is the root of static files. If
staticurldoesn't start with a slash, it's taken as a subdirectory ofrooturl. (Requiresstaticdirto be set.)charset- If set, DWiki claims that all text/html and text/plain content it generates is in this character set in HTTP replies. Normally 'UTF-8' these days. If unset, DWiki does not label text/html and text/plain HTTP replies with character set information. You should set this to 'UTF-8'. Really. It shouldn't even be optional.
cssurlprefix- This is technically not a DWiki configuration directive as such because it isn't interpreted by the program. Instead it's used by the standard
html/csstemplate as one option for where to find DWiki's standard CSS file,dwiki.css. If this is set it's the URL of a directory (without a trailing slash). If this is not set, thehtml/csstemplate assumes thatdwiki.csscan be found at${staticurl}/dwiki.css. It's more efficient to servedwiki.cssoutside of DWiki itself, since it's a static file.Note that various parts of DWikiText rendering do not look right if the CSS is missing (in particular, all sorts of tables are likely to look bad).
DWiki URL to file mapping
When DWiki gets a request for a URL, it tries to turn it into a request for something under either
staticurl(if defined) orrooturl; whatever is left after subtracting the appropriate thing is the path being served relative tostaticdirorpagedir.staticurlis checked first, so it can be a subset of the URL space available underrooturl.For safety reasons, DWiki only tries to process a request if the request's URL falls under either
staticurlorrooturl. If DWiki receives a request for anything outside those two, something is clearly wrong and it generates a terse error page.When it generates URLs for DWiki pages DWiki normally puts
rooturlon front (as a directory). However, if you setpublicurlDWiki puts that on the front instead.This is useful if for internal reasons you receive requests with their URLs rewritten to something users shouldn't (or can't) use. The case ChrisSiebenmann knows is Apache with URL aliases and the DWiki CGI-BIN being run via suexec.
Authentication
See Authentication for more information on the authentication system.
authfile- Where DWiki can find user / password / group information for the DWiki's users. If this is set, the DWiki has authentication.
defaultuser- If set, all otherwise not authenticated connections get to be this user, if the user is in
authfile. This should be used carefully, as it makes all requests to the DWiki be authenticated (since they all have a user, if even only the default user). If this is set, the username it is set to is said to be the 'guest user'.global-authseed- This is a special magic token to make it harder to brute-force people's DWiki passwords in some situations. It can be any value and should be kept secret.
global-authseed-file- This is the file to read
global-authseedfrom, if it is set. The file has no special format, but should contain some randomness and its contents should be kept secret.authcookie-path- This controls the 'path=' value for the authentication cookies generated by DWiki. If not set to a value, we use the root URL; otherwise we use the value straight. If it is not set, authentication cookes have no explicit 'path=' setting. ChrisSiebenmann has come to believe that you don't want to set this, and it remains as a vestigial remnant.
logins-report-bad- If present in the configuration file, DWiki will log the username (or at least the first 50 characters of it) for bad logins with unknown usernames. This is not necessarily a good idea but at one point was interesting to track what form-stuffing spammers were doing.
Comments
commentsdir- The root directory for storing comments in. The only place DWiki writes permanent data to.
comments-on- Enable commenting in this DWiki. This requires that
commentsdirbe defined and that authentication be enabled.comments-in-normal- Your standard templates display comments on the normal view of the page instead of the 'showcomments' view.
remap-normal-to-showcomments- DWiki will remap the 'normal' view for pages to the 'showcomments' view, thereby implementing
comments-in-normalwithout you needing to change the standard templates.If you want to enable anonymous comments you should create a
guestuser in the DWikiauthfileand then setguestas thedefaultuser. (Well, you can use the username of your choice, butguestis conventional.)Caching
DWiki can optionally cache the results of page generation to speed up response time. See Caching for a longer discussion.
cachedir- The root directory for storing the caches. It should not be used for anything else (ie, not it should not also be
pagedir,tmpldir, orcommentsdir). DWiki will write scratch files to here.cache-warn-errors- Log warnings about cache store errors. (These are non-fatal but indicate that your cache isn't caching.)
render-cache- Enable caching the results of selected renderers and renderer components. (Requires
cachedirto be set.)render-heuristic-ttl- The TTL of renderer cache entries with heuristic validators, in seconds. The default value is an hour.
render-anonymous-only- Use the renderer cache only for the guest user or for connections that are not authenticated.
render-heuristic-flagged-ttl- The TTL of renderer cache entries that have explicit invalidation (aka 'flagged' cache entries), in seconds. The default value is 48 hours, as explicit invalidation is considered safer than heuristic invalidation.
render-heuristic-flagged-delta- In order to lessen the chance of races between renderer cache invalidation and renderer cache regeneration, flagged cache entries must be at least this many seconds more recent than the invalidation marker (if it exists). Defaults to 30 seconds.
bfc-cache-ttl- Enable a brute force page cache of complete pages with a TTL of this many seconds. (Requires
cachedirto be set.)bfc-time-min- A complete page will be cached if it took at least this much of a second to be generated. Defaults to 0.75 of a second.
bfc-load-min- A complete page will be cached if the load average is at least this high. No default; the BFC normally doesn't look at the load average at all.
bfc-time-triv- Regardless of the setting of
bfc-load-min, don't bother looking at the load average if the page took at most this long to generate. Defaults to 0.09 of a second.bfc-atom-ttl- Use this TTL for Atom syndication requests, instead of the normal one.
bfc-atom-nocond-ttl- Use this TTL for Atom syndication requests that are not using conditional GET, and also force the caching of the results of these requests regardless of the load.
bfc-skip-robots- If set, this is a list of User-Agent substrings (formatted as for
bad-robots, see later) for robots that should not cause entries to be put into the BFC.imc-cache-entries- Enable an in-memory cache of complete pages with this many entries. The IMC skips all pages that the BFC skips. The IMC is only meaningful if the same process handles more than one request, so by default it is only enabled if DWiki knows that it is running using
dwiki-scgi.pyas a preforking SCGI server.imc-force-on- Force the IMC on even if DWiki would not enable it. You probably only want to use this if you are running DWiki as a WSGI application inside a preforking WSGI server such as uWSGI.
imc-cache-ttl- The TTL, in seconds, of entries in the in-memory cache; must be provided if
imc-cache-entriesis.imc-resp-max-size- The maximum size (in kilobytes) of pages that will be cached in the in-memory cache. The default value is 256 KB.
slow-requests-by- Delay all requests by this much, in fractional seconds. Normally used only for testing BFC.
In practice some degree of caching is mandatory for decent performance once your DWiki gets big enough and so it's recommended that you turn on
render-cacheandbfc-cache-ttlunless you have a good reason to do otherwise. Turn onimc-cache-entriesandimc-cache-ttlif you're using SCGI.Syndication feed controls
atomfeed-display-howmany- How many items at most an Atom feed should display. If set, it must be a positive integer; if not set,
atom::pagesandatom::commentsuse a default of 100 items.feed-max-size- How many kilobytes
atom::pagesoratom::commentsshould try to limit their output to. If set, either stops adding new entries (regardless of how many entries have been processed already) once they have generated that many kilobytes or more of output. Because of the 'or more' clause, you should allow for a safety margin. If unset, syndication feeds are not size-limited.feed-max-size-ips- If set, this is a whitespace separated list of IPv4 addresses, tcpwrappers style IPv4 address prefixes (eg '
66.150.15.'), or IPv4 CIDRs (eg '66.150.15.0/25') thatfeed-max-sizeapplies to. Syndication requests from any other addresses are not size-limited. If unset,feed-max-sizeapplies to all syndication requests, regardless of what IP address makes the request. This option can be specified multiple times; if so, all the addresses are merged together.feed-start-time- If set, pages older than this time will not appear in Atom feeds, which is handy if you want to move a DWiki, redirect the old URLs, and not flood people's Atom feeds (because the Atom <id> for pages is the page's full URL unless you've set
atomfeed-tag). The value can be specified either as an integer Unix timestamp, as 'YEAR-MO-DA [HH:MM[:SS]]', 'YEAR/MO/DA', or an Atom format time string, and is always in local time (even when specified as an Atom format time string; sorry).atomfeed-tag- If set, the
atom::pagetagrenderer will use it to generate Atom <id>s for pages in the format <tag>:/<page path>. This should normally be set to atag:-based URI; see here for a discussion.atomfeed-tag-time- If set, the
atom::pagetagrenderer will only generate tag-formatted Atom <id>s for pages more recent than this time. This can be used to make a graceful transition into tag-based Atom <id>s for an existing DWiki (and then, withfeed-start-time, to graceful move it). This has the same time format asfeed-start-time.
atomfeed-virt-only-adv- If set, restrict what Atom page feeds are advertised for virtual directories. If we are displaying a vdir and it is not a listed type, we advertise the Atom feed for the real directory instead (eg, for 'blog/2007/10/' the Atom feed advertised would be for 'blog/'). This is a space-separated list of vdir types; the allowed types are
latest,oldest,range,calendar, and thecalendarsubtypesyear,month, andday.atomfeed-virt-only-in- If set, restrict what virtual directories allow Atom page feed requests. A disallowed
latestorrangefeed request is (permanently) redirected to the real directory's feed; other disallowed feeds get 404 responses. The format and list of vdir types is the same as foratomfeed-virt-only-adv. If this is set, it becomesatomfeed-virt-only-in's default value. If both are set, this should be a superset ofatomfeed-virt-only-adv's value; otherwise DWiki will advertise feeds that it will refuse requests for.You should normally allow feeds for
latestbecause this gives people a way of controlling how large a feed they pull from you; they can use, eg, 'blog/latest/10/?atom' to pull only a ten-entry feed instead of your full-sized feed.These two directives don't change or affect what Atom comment feeds are advertised or allowed; they affect only Atom feeds for pages.
Other features:
alias-path- This sets the DWiki path for the third place to try to find CamelCase links in (see Formatting). This allows a DWiki to have a collection of CamelCase names for things that are globally usable but that don't clutter up the DWiki root directory.
This is a DWiki path, not a filesystem path (and is implicitly always an absolute DWiki path). The conventional value isAliases.search-on- enables searching. If it has the value 'authenticated', only authenticated users can search. Note that if you have a guest user set, all users are authenticated.
blog-display-howmany- How many items the
blog::blogrenderer should try to restrict most pages it displays to. If set, it must be a positive integer; if not set,blog::bloguses a default.
canon-hosts- If set, this is a space-separated list of canonical hostnames for this DWiki. If a request has a
Host:header that is not in this list, DWiki immediately serves up a redirection to the first hostname in the list (orcanon-host-url, if that is set), which is assumed to be the preferred hostname.canon-host-url- If set, this is the canonical URL for the host of this DWiki (without the ending
/, but including http or https and the port if necessary). DWiki will generate redirects and absolute URLs that use this URL. Ifcanon-hostsis also set, this should be the full version of the first entry incanon-hosts.(This is primarily useful in some hopefully unusual situations involving HTTP-to-HTTPS transitions.)
literal-words- If set, this is a list of strings, separated by '
|' (space, |, space), that will be rendered literally and not considered to contain markup, as if each of them had been specified in '.pn lit <whatever>' processing note directives.Special oddities
dump-req-times- Report the amount of time that requests took to standard error. This is set by the standard
-Toption.dump-atom-reqs- Report on Atom requests to standard error. This is set by the standard
-Aoption.stamp-messages- Add timestamp and client IP address to messages reported by the above two options. This is set by the standard
--stampoption.These are documented because you might want to set them directly if you're running DWiki as a WSGI application inside some standard WSGI server (such as uWSGI, Apache's mod_wsgi, or gUnicorn).
Dealing with bad clients:
bad-robots- If set, this is a list of User-Agent substrings, separated by '
|' (space, |, space), for robots that should get permission denied responses when they try to fetch pages in various views that no robot should be fetching. Currently the list of bad views is atom, atomcomments, source, and writecomment, all of which are typically fetched by robots that don't respectrel="nofollow"on links.no-ua-is-bad-robot- If set, any request with a missing User-Agent header is considered to be from a bad robot.
banned-robots- If set, this is a list of User-Agent substrings (formatted as for
bad-robots) for robots that should get permission denied responses on all requests.banned-ips- If set, this is a list of IPv4 addresses, tcpwrapper style IP prefixes, or CIDRs (as for
feed-max-size-ips) for addresses that will get access denied responses for all requests. It can be specified multiple times.banned-comment-ips- If set, this is like
banned-ipsbut only applies to attempts to write comments.bad-robot-ips- If set, this is like
banned-ipsbut only applies to requests that try to fetch pages in various views that no robot should be fetching (as inbad-robots).Under normal circumstances it's more efficient to use your web server's access controls to totally ban IP addresses and bad user-agents; your web server usually has faster code for this and you don't have to get DWiki involved in the process.
banned-robotsandbanned-ipsexist because this is not always possible.
What templates DWiki uses
Per ProcessingModel, DWiki ultimately produces output by expanding a template. This means that DWiki has to figure out what template to use for this process, and because the TemplateSyntax is fairly limited, it is much simpler for DWiki to start with a separate template for every different view of things it wants to have.
This means that while DWiki tries not to hardcode template names or the structure of the template directory, there are a certain amount of hardcoded names it knows about that need to be there for proper DWiki operation.
The short list of such templates is:
dwiki/view-*.tmpl,dwiki.tmpl: starting view templates.views/*: conventional location for templates that display a particular ordinary view.error.tmpl,errors/*: displaying errors (always 404 responses).login-error.tmpl: displaying a login error (a regular page, not a 404).- Comment templates:
comment/comment.tmpl: used to show each comment when we're showing all comments.comment/posting.tmpl: used to show the result of posting a comment. By convention,comment/posted-<result>.tmplis used to display specific results, where<result>is one of 'good' (the comment was posted successfully), 'bad' (something went wrong), 'badchars' (the comment has bad characters in it), or 'nocomment' (the comment was empty and DWiki refused to post it).blog/blogdirpage.tmpl: used to show each page in BlogDir view.blog/blogentry.tmpl: used to show each page in Blog view.syndication/atomentry.tmpl: used to render an Atom feed entry for each page.syndication/atomcomment.tmpl: used to render an Atom feed entry for each comment.syndication/rss2entry.tmpl: used to render an RSS 2.0 feed entry for each page.All paths are relative to the template directory.
Determining a template for a view
For views that are displayed using templates, DWiki tries to find the starting template by looking in three places, in order:
dwiki/view-<view>-<pagetype>.tmpldwiki/view-<view>.tmpldwiki.tmplBy convention, everything that generates text/html pages just goes through
dwiki.tmplso that there is one place that does top-level 'skinning' for the entire DWiki. Only views that both use templates and generate something besides text/html sidestep this.The standard
dwiki.tmpluses the#{<...}first-found template inclusion mechanism (see TemplateSyntax) to pull in the real per-view content. It looks in four places to try to find this content, in this order:
Overrides/...$(page)/$(view-format).tmplOverrides/...$(page)/all.tmplviews/$(view-format)-$(pagetype).tmplviews/$(view-format).tmplThe first two allow page and directory hierarchy specific overrides; the latter two are the generic places. Most views don't need to distinguish between file types, but the 'normal' view must use different templates for files and directories (since a directory doesn't have wikitext to display).
The current template-based views are: normal, history, search, blog, blogdir, atom, atomcomments, sitemap, showcomments, and writecomment. The login and logout views are 'synthetic' and don't actually display anything unless an error happens. The 'source' view simply dumps the page content out straight without getting anywhere near templates.
Note that the atom and atomcomments views are special: although they render through templates, they generate application/atom+xml content instead of text/html. Thus they use
dwiki/view-*templates directly, bypassingdwiki.tmpl. The sitemap view is similarly special, although it generates application/xml content.Error templates
Errors are rendered by the template
error.tmpl. There are special error rendererserror::titleanderror::bodythat look for error-specific additional templates in the subdirectoryerrors/. Each type of error looks for titles aserrors/<error>-title.tmpland main error body aserrors/<error>.tmpl(with internal defaults if they don't exist).Current error types: badaccess, badformat, badpage, inconsistpage, nopage.
Everything else is free and floating
That's it. DWiki has no other hardcoded template names.
DWiki's caching system
DWiki has optional caching in order to speed up generating results repeatedly. DWiki uses a disk-based cache for this (although the interface is abstracted and alternate forms of caching may be introduced someday). There are three caches, which can be enabled separately: the renderer cache, a brute force page cache, and an in-memory brute force page cache that is only used if DWiki is running as a preforking SCGI server.
DWiki never removes the files of out of date cache entries from the disk cache; instead, it stops considering out of date ones to be valid. Cleaning out the detritus is left for an external process. ChrisSiebenmann considers this safer; giving a program an automated
unlink()makes him nervous.See ConfigurationFile for the options controlling the behavior of the caches.
In theory DWiki's caching is optional. In practice a decent sized DWiki is simply too slow without caching for some of the more expensive operations and caching becomes more or less a necessity. ChrisSiebenmann now believes that you should configure all levels of caching in basically any DWiki unless you have some unusual need and are sure.
The brute force cache
The brute force page cache is about as simple as you can get: it caches complete requests for a configured time (called a time-to-live, or TTL). That's it. The BFC is intended as a load-shedding measure when DWiki is under significant load, so it only acts under certain circumstances:
- only on
GETorHEADrequests.- only on requests without a
Cookie:header.- requests only get put into the cache if the system seems loaded.
(For speed, when something is valid in the cache DWiki just serves it without checking the system load.)
A good BFC TTL is on the order of 30 seconds to three minutes or so; long enough to shed significant load if you are getting a lot of hits to a few pages and short enough that dynamic pages won't become too outdated. (And that waiting to see a comment show up or whatever is not too annoying.)
Because Atom syndication requests are among the most expensive pages to compute, the BFC can be set to give them a longer TTL than usual. There is a second TTL that can be set for Atom requests that aren't using conditional
GET; the idea is that if requesters cannot be bothered to be polite, we can't be bothered to serve fresh content. Setting this option always caches the results of such requests, even if the load is low, which means that even people doing proper conditional GET requests will use the cached results for as long as their (lower) TTL says to.It's actually faster to serve static pages from the static page server code than from the BFC, so the BFC doesn't try to cache static pages.
The two sides of the BFC
It's important to understand that the BFC does not check load when it is checking to see if something is in its cache. This means there are two stages to processing a request: deciding what TTL to use for cache checks, and deciding whether to cache something that was not current in the cache.
The TTL used is:
bfc-atom-nocond-ttlif this is an unconditional request for an Atom view, if set.bfc-atom-ttlfor Atom view requests in general, if set.bfc-cache-ttlotherwise.Pages enter the BFC cache either because the system seems to be loaded or because
bfc-atom-nocond-ttlwas set and they were an unconditional request for an Atom view.Once something is in the cache, it will be served from the cache if it is not older than the check TTL. Different requests can use different check TTLs for the same cached page; for example, conditional GETs versus other requests for Atom views.
The in-memory cache
The in-memory cache is essentially a version of the brute force cache that holds pages in memory instead of on disk. It's only effective in environments where DWiki serves multiple requests from the same process; currently it's only used if DWiki is running as a preforking SCGI server. Because it holds pages in memory as page response objects, the in-memory cache is about the fastest way that DWiki can serve requests. In particular it's faster to serve static pages from the IMC than from disk, so unlike the BFC the IMC does cache static pages.
Because IMC entries disappear automatically and are essentially free to create, the IMC caches pages unconditionally when active (unlike the BFC). This means that it should normally have a relatively low TTL, often lower than the BFC's TTL. Note that because the IMC is before the BFC, it can load its cache from BFC cache hits.
For obvious reasons, it's pointless to set the IMC cache size to be larger than the number of requests a preforked SCGI process will serve before exiting.
To keep IMC memory usage under control, the IMC has a settable maximum page size that it will cache. Tune this as appropriate for your environment.
The IMC can be deliberately forced on with
imc-force-on, in case you're running DWiki in some other preforking environment (for example as a WSGI application under a preforking WSGI server such as uWSGI).Considerations for the IMC TTL
Under some setups, DWiki will only be running as a (preforking) SCGI server when it's under heavy load; in others DWiki is running this way all of the time, even when the load is light. Because the IMC unconditionally caches pages the latter situation can be annoying; it means that someone who, say, writes and posts a new comment may not see that comment until the IMC TTL expires. DWiki makes some attempt to bypass the IMC (and the BFC) in the common case of someone leaving a comment. However this is not perfect (in part because it requires the web browser to accept cookies from DWiki).
(This also applies if you're running DWiki in some other preforking environment and have forced the IMC on.)
If you're running DWiki full time in an IMC-on environment, you likely want to set a quite low IMC TTL, such as 15 to 30 seconds. If you're running DWiki with the IMC on only under heavy load you can set a higher IMC TTL, such as two minutes (120 seconds).
The renderer cache
The renderer cache is actually two caches. The renderer cache proper caches the output of various renderers (cf TemplateSyntax). The output is cached with a validator and the cached results are fully validated before they get used; this means that renderer cache entries do not normally use a TTL and in theory could be valid for years.
The (heuristic) generator cache caches the output of some expensive precursor generator routines. These cache entries only have heuristic validators, where DWiki can be fooled if people try hard enough. Generator cache entries do have a TTL, so that if the heuristic is fooled DWiki will pick up the new result sooner or later. Some cache entries can also explicitly invalidated by DWiki in a pretty reliable process; by default, these have a much longer TTL than plain heuristic cache entries. These are called 'flagged' (heuristic) generator entries and various ConfigurationFile settings controlling how they behave are
render-heuristic-flagged-....(Trivia: the 'flagged' name is because such entries are invalidated using a flag file, or more accurately a flag cache entry.)
Currently the main renderer cache caches the output of various wikitext to HTML rendering routines while the generator cache caches the results of various filesystem 'find all descendents' walks that are used to build lists of comments (for Atom comments feeds and some wikitext macros; this uses explicit invalidation) and lists of pages (for Atom feeds and various blog renderers such as
blog::prevnext).Unfortunately, a DWiki page that has comment or access restrictions must be cached separately for each DWiki user that views it. Under some situations this can result in a number of identical copies being cached under different names. If you want to avoid this, DWiki lets you turn off renderer caching for non-anonymous users.
Force-invalidating list of pages caches
The general validator for
blog::prevnextcache entries is the modification time for all of the directories involved that had files in them at the time (the latter condition is for technical reasons). The heuristical validator checks that some of the file timestamps are still the same, but it can't check all of them and still be a useful cache.So the easy way to invalidate this is to change the modification time of a directory involved, for example with
touch.The 'list of pages' cache is similarly invalidated by changing a directory modification time. Unlike the
blog::prevnextcase, the directory times are the only thing that this cache checks. This is a bit of a pity but the performance improvements from caching this information are very visible.Disk space usage and directories
Much like comments, each page that has something cached for it becomes a subdirectory, with the various cached things in files. The different sorts of caches use different top-level directories under the
cachedir, so you have paths likecachedir/bfc,cachedir/renderers, andcachedir/generators.Because some results include absolute URLs that mention the current hostname, DWiki must maintain separate caches for each
Host:header it sees in the BFC and the general renderers cache. These are handled as subdirectories in each cache directory, socachedir/bfc/localhost/...and so on. Entries in the generator cache don't depend on the currentHost:header, so there is only one (sub)cache for all requests,cachedir/generators/all/....Generally the general renderers cache uses the largest amount of disk space, followed by the BFC, and the generator cache is the smallest.
If you're using caching (and as mentioned, you probably want to), you'll want to periodically trim the caches. ChrisSiebenmann just does this by hand every so often by removing the cache directories entirely; DWiki will then rebuild them as necessary.