Wonder if CSS3 <figure><figcaption> elements would be of use to you? See for example
http://html5doctor.com/the-figure-figcaption-elements/
Cannot recall the particulars, but Dan Cederholm's "Bulletproof Web Design" had a wonderful bit using Definition Lists <dl><dd>-</dd><dt>-</dt></dl> to hang text and images together.