Getting rained out in a brainstorm

In a meeting yesterday, somebody on the IRC shared links to “Re-identification of home addresses from spatial locations anonymized by Gaussian skew” and “Bregman divergences in the (m x k)-partitioning problem“, from Science Digest. Sez the abstract of the latter,

A method of fixed cardinality partition is examined. This methodology can be applied on many problems, such as the confidentiality protection, in which the protection of confidential information has to be ensured, while preserving the information content of the data. The basic feature of the technique is to aggregate the data into m groups of small fixed size k, by minimizing Bregman divergences. It is shown that, in the case of non-uniform probability measures the groups of the optimal solution are not necessarily separated by hyperplanes, while with uniform they are. After the creation of an initial partition on a real data-set, an algorithm, based on two different Bregman divergences, is proposed and applied. This methodology provides us with a very fast and efficient tool to construct a near-optimum partition for the (m×k)-partitioning problem.

Keywords: Confidentiality; Data masking; Fixed cardinality partitioning; Fixed size micro-aggregation; Bregman divergences; Pythagorean property; Convex partition

What’s extra wacky is that I actually spent time diving into this stuff, even though it’s about forty thousand leagues over my head. Still, it was fun trying to remember all that math I barely learned too long ago.

As I recall, the highest grade I ever got in high school math was a C. That was in Geometry. (Hey, I’m a visual guy.) The only math course I took in college was Statistics. The teacher and I couldn’t stand each other, and I dropped out, or thought I did. Turns out I was too late doing that and the guy gave me an F.

But I kept the book, which served me well years later when I was studying Arbitron’s ratings for radio stations. To my surprise, I actually liked the subject, and used what I learned from the book to develop algorithms for factoring out seasonal variations in station AQH (average quarter hour) shares, to aid in predicting which stations would do what in the next “book”. In addition to racking up billable hours for my company, and helping our client station sell advertising, I was able to win bets with friends in the radio business.

The biggest bet of all was that WFXC, the station with the weakest signal in the Raleigh-Durham metro, would kick ass in the first book after its programming went “urban” (that’s radio talk for “black”). The math was easy. The market was about 40% black, and no other FM stations addressed that population.

I won. Foxy was #1 in its first book. (And it’s still doing well, 2+ decades later.)

As it happens, WFXC “Foxy 107” (a name I suggested to the owners before they picked the call letters, though I don’t know if I was the first to come up with that) was consulted at the time by Dean Landsman, whom I didn’t know at the time. We became good friends years later when we both haunted the late Compuserve’s late Broadcast Professionals Forum, which was run by Mary Lu Wehmeier, now a friend as well. She was the “Sysop” for that forum, where I occasionally came off the bench to help. Running the Sysop Forum was Jonathan Zittrain, who later helped found the Berkman Center, and now stars as a professor at Harvard Law School. Making things even more circular, Dean is now a valuable and diligent contributor to ProjectVRM. Dean, a closet math whiz, made a living for many years doing in-depth work around radio station ratings. I’ll be he knows, or could puzzle out, the quoted text at the top of this post.

By the way, my nickname is the fossil remnant of a radio persona called “Doctor Dave”, featured on WDBS, the prior incarnation of WFXC, which is still around (now with a somewhat better transmitter, and a second and much larger signal on another channel, covering the east side of the market). When I was there, in the mid-’70s, WDBS was owned by Duke University and had awful ratings to go with its awful signal. But it was a great little station. Still friends with folks from those days too.

Ah, I found the picture I was looking for, now at the top of this post. That was the WDBS staff in 1975, I’m guessing. I’m the guy with the wide tie and the narrow shoulders in the back row. There are many missing folks too. I’d love to follow this digressive path, but have too much work to do. At least I’ve left plenty of link and tag bait. 🙂

1. Awesome Doc on the Foxy thing! You certainly go incredibly deep and broad in the radio medium…

BTW the particular math abstract you cite is relevant to my day job, which involves map and location content data management and the data products and services that result. Partitioning large (and typically spatial, but not always) data sets and ensuring confidentiality of (let’s call it) user shared location data keeps coming up over and over. So I’m going to share this…

I did pretty well in college math, but that’s years and years ago (I’m close to your age cohort) and have some trouble understanding abstracts like this too, but a long careful slog can sometimes be rewarded, especially if/when the writer takes the effort to tie an (ahem) “executive summary” to the actual mathematics as they are being exposed. In my experience, some do, and some don’t.

2. Doctor Dave,

I have followed and share your interest in content delivered to internet consumers bypassing traditional content silos as the net infrastructure moves forward.

You have a great understanding of the traditional broadcasting model and have written many pieces on the subject from the past, and your vision of decentralized broadcasting going forward.

Some of these articles are here, some are at the Linux Journal.com, and some are in the printed version of the Linux Journal.

Do you have a index of the wealth of different articles you have written on broadcasting?

Thanks for all of what you do.

3. Hey, John.

I haven’t been addressed as Doctor Dave in thirty years (except by a few Olde Ones who knew me back in the century).

There is no collection, far as I know. I mean, I usually look up a word string on Google and hope for the best.

I would love to have somebody gather up just my Linux Journal pieces and make a book of them. But it’s a low priority for pretty much everybody, including myself, hate to say. Looking forward and all that.

4. Doc,

We should speak.

I made a note along the way about a country music article from the Linux Journal, do you remember it?

So, I Googled “Doc Searls Linux Journal” and found the “Giant Zero” YouTube that I started watching last year…I will watch it through.

I think I have a Linux Journal CD around here somewhere and a big pile of LJ Magazines. I will look there next.

ideas…

5. Hey Doc,

I see a Progressive Insurance ad on TV , the series with the white background and the cute brunette, where the Progressive brunette provides a hand held scanner to the insurance customer guy so he can “tell” Progressive how much he wants to pay for the boxed insurance they provide.

The cute brunette says “power to the people” !

Is this a step toward your VRM model ?

6. That photo is so 70s! I had friends who looked like that but we were a bit south of Raleigh/Durham (north Florida).

Keep up the good work w LJ.