I've been a faithful spamassassin user for a long
time. Never thought much about how it worked, just been happy.
But a lot of spam is leaking through the last month, so I went looking
for a tuneup.
I tried my new spamassassin setup on 594 emails my old
spamassassin setup said were not spam. The new setup correctly
identified 342 as spam and 247 as non-spam. It identified 5 messages as
non-spam when in fact they were spam, and a reassuring 0
messages as spam which were not spam. This is all excellent!
Of the 342 newly-found spam, 303 were caught by the Bayesian filter, 172 by Pyzor, and 160+ by Razor. Update 2004-01-16: I made a boneheaded mistake in this evaluation. I trained the Bayesian filter on some data, then tested it on the same data. D'oh. The reality is the Bayesian filter is still much better than without, but not quite as stellar.
After surveying
BitTorrent
download traffic for Matrix: Revolutions last month I thought I'd
check out download traffic for a movie that
didn't suck.
![]() 10,000 people grabbing a 2 gig file is a lot of video on demand. Still, crappy quality. The one sample I saw had decent sound and the picture didn't shake but the colour was all washed out. The cinematography in ROTK is so beautiful $9 is a bargain. ![]() Folks have found various ways to install custom software on it. There are two basic routes: a full Linux install or adding homebrew programs to the native OS (a stripped down Win2k). Of the homebrew programs XBox media center has gotten the most attention. It's a multimedia suite that plays movies, music, photos, etc from the local hard drive, from DVD, or streamed from a server. XPort is also impressive. One individual has ported about 20 different emulators from Windows to the XBox. Gameboy, Nintendo, Playstation, Atari 2600, Intellivision, Apple ][, etc etc, all running neatly. Quite an achievment. Many folks have speculated that Microsoft's plan with XBox is to slowly move Windows into the living room. The existing XBox hardware is already sufficient to do this, it's just the software isn't readily available yet. The hacker scene is about a year ahead. ![]() I'm still trying to fix my account after November's customer service meltdown. For the last six weeks customer care has "been experiencing heavy call volume" with "wait times longer than 20 minutes". The worst thing is the damned ads they play at you while you're on hold. As seen on Howard Forums
I bought the hype that 802.11g is 54 Megabits/second.
I paid extra for expensive 802.11g gear.
I had this stupid idea that 802.11g's 54Mbps was close enough to
100Mbps ethernet that I didn't need to run wires in my house.
It says so right on the box of my Linksys
WET54g
and in the product
sheet: "Wireless-G (54 Mbps)".
This is false advertising. The fastest 802.11g will go is 20Mbps, not 54Mbps. And in a mixed 802.11b/g network the fastest two 802.11g devices can go is 14Mbps. I verified this with a WET54g sitting right next to a WAP54g. Throughput on an FTP? 12.8Mbps. And this is best-case, quiet network with devices right next to each other. In a real deployment I get 30% packet loss. Between speed, security concerns, and general flakiness wireless is really not a reasonable option for regularly copying 2 gig files around. Good thing Amazon has a generous return policy. At least I get my $150 back.
One of the Bizarro-world realities of today is
that the same White House folks who conduct the war on Iraq were
sucking up to Hussein as an ally 20 years ago. Two items on
this. First, a lovely
Mike Luckovich cartoon
Second, a story in today's NYT:
Rumsfeld
Made Iraq Overture in '84 Despite Chemical Raids.
As a special envoy for the Reagan administration in 1984, Donald H.
Rumsfeld, now the defense secretary, traveled to Iraq to persuade
officials there that the United States was eager to improve ties with
President Saddam Hussein despite his use of chemical weapons ...
"The Iraqi leadership was extremely pleased with Amb. Rumsfeld's visit," the memo said. "Tariq Aziz had gone out of his way to praise Rumsfeld as a person." Dec 20 was the 20th anniversary of Rumsfeld and Hussein's handshake. More: National Security Archive. ![]()
The music industry's attempts to force digital data to behave like
physical objects has had two profound effects, neither of them about
music. The first is the progressive development of decentralized
network models, loosely bundled together under the rubric of
peer-to-peer. ... And the second effect, of course, is the long-predicted and
oft-delayed spread of encryption.
The cypherpunks movement is a
very powerful set of ideas. But they all slammed into the wall of
consumer indifference. I think Clay overstates the case a bit, but I
agree with him that the RIAA is driving crypto.
The other place that the RIAA is setting the cypherpunk vision in motion is their own DRM technologies. Watermarks, locked media, Palladium: it's like a cypherpunks wet dream. Only it's a nightmare: the cryptokeys are in the hands of just a few people.
Jon Carroll has an insightful and amusing
column
on the idea that Howard Dean is a wild-eyed liberal.
So what Howard Dean said is not radical or remarkable or
innovative. I mean, he's an interesting guy, and I might even find
myself voting for him, but he's not Roosevelt or anything. He just
hasn't signed on to the Official Bush/Cheney/Wolfowitz worldview,
which makes him a suspicious character indeed.
![]() The detentions of hundreds of people at Guantánamo is pretty bad. The Bush Administration has been arguing that since the detainees are in Cuba, they have no rights under US or, presumably, any law. Nice! Fortunately the Ninth Circuit said that was ridiculous since the US runs the camp in Guantánamo. While Guantánamo is bad, the case of Jose Padilla is horrible. Here we have a US citizen, arrested in America, and the government has been claiming he has no rights. Secret detention by the US government: no lawyer, no charges, nothing. It's absolutely outrageous, and finally a court said so. Even if the man is guilty of all he's charged with, that's no excuse. Americans have died defending our freedoms for over two hundred years. Bush's Justice Department seems happy to trample all over that, the courts are finally responding. ![]() Watch Paris and Nicole show up an hour late to milk the cows and spill the milk everywhere. That's OK, they'll just 'work' at the fast food place tomorrow! Watch the well-meaning Sonic manager try to train the rich girls for a wage slave job. Watch the rich girls ridicule the job behind the manager's back. That's OK, they'll just 'work' somewhere else tomorrow, and after the show is over they'll go back to being rich. After 'the girls' are back in their vapid lives the farmer is still going to be working his ass off trying to make a living with dairy cows and the fast food manager is still going to be working as hard as she can at $7 an hour to make ends meet. And that's going to be the rest of their lives. Rather than sympathizing with the hard and honourable realities of being lower middle class in rural America, the show turns it all into a cheap disrespectful joke.
The media is
stoking the fear
that this year's flu season is going
to be worse than ever. Remember 1918?
Lovely bit of alarmist infographics. Does the line continue to go up?
Is it reporting bias or a real trend?
My friend Marc points out that in this infographic the flu is Republican. ![]()
Of Halliburton's $2.64/gal, $1.17 is the price they pay in Kuwait, $1.21 is the cost of Halliburton transport, and $.26 is Halliburton's explicit markup. This is just a tiny example of the cost of oil and defense companies owning the White House. The Houston Chronicle has Halliburton's story.
I'm with
Steve: HTTP already has plenty of ways to handle caching, don't
invent something
new for RSS/Atom aggregators. If they just follow Mark's
rules (handy tests and
instructions), life
will be fine.
I worked hard to help HTTP caching on my blog. It's complicated, particularly with the pastiche of dynamic content I have. Used to be 40% of my weblog requests were answered with a bandwidth-saving 304. When I added my linkblog it went down to 25%, probably both because the HTML view changes more often and because I removed ETags support. Most aggregators do fine. Radio Userland is having trouble since I turned off ETags. And NetNewsWire has a surprisingly low number of 304s, although a quick inspection doesn't show anything obvious. ![]()
Recognizing distinctions such as the "territories" helps show the
human face of a population that is now — to most San Franciscans —
both extremely familiar and painfully foreign at the same time.
From Car Nation to the Heroin Zone to the Service / Crack Zone, it's
all there. Fodor's 2004 should include this info.
Thanks to Impacket I now
have a bit of fascinating news: most of my blog readers have an MTU of
1500 bytes. The Maximum Transfer Unit is the size of a TCP packet. You
want this to be as big as possible. 1500 is
generally the limit on the Internet (it's the Ethernet limit), but
smaller sizes may be better depending on your net connection.
![]()
# Print out sizes of IP packets
import pcapy, impacket, impacket.ImpactDecoder decoder = impacket.ImpactDecoder.EthDecoder() # packets = pcapy.open_live("eth0", 1500, 0, 100) packets = pcapy.open_offline('/tmp/cap/capture') packets.setfilter('ip') for i in xrange(100): (header, data) = packets.next() eth = decoder.decode(data) ip = eth.child() print ip.get_ip_len() It's brand new. The docs are nearly nonexistent and the library isn't as Pythonic as one would hope. But it works pretty well! Compare also scapy (less libpcap-like). PS: I ran into a problem installing on Debian
ImportError: /usr/lib/python2.3/site-packages/pcapy.so: undefined symbol: __gxx_personality_v0
The workaround was to link the .so with g++ instead of gcc.
This is either a bug in gcc or Python distutils.
|