February 2010 Archives

Seriously Debugging the Text Indexer Code

  • Posted on February 28, 2010 at 4:55 pm

After feeling like wading in honey during the past weeks I finally get around to squash some bugs in my text indexer code. The first one was the obligatory use of a null pointer in rare cases. I know, this should never happen. Found it, squashed it. Won’t happen again (I am pretty confident about this).

The next problem was a wrong string comparison when dealing with file extensions. Ignoring the “.” leads to match of “ps” and “props”. The latter is no PostScript® file and cannot be indexed (well, it can be, but it shouldn’t). “.” are from now on never ignored.

The test data consists of 3755 files. After filtering 648 documents remain (file extensions .doc, .htm, .html, .odp, .ods, .odt, .ps, .pdf, .php, .rtf, .txt, .xml, .xls). The results are indexed by means of the PostgreSQL text index function. The resulting database has a table size of 488 kiB (23 MiB documents, 19 MiB text index). Indexing works fairly well so far. The database should be more than sufficient for testing the front end. I’ll probably have a go at the content of the two Cryptome.org DVDs I ordered a couple of weeks ago. Both DVDs contain 42914 files in 1106 directories. The total size is over 8 GiB. Maybe I publish the front end URL to the indexed Cryptome data. Let’s see.

The Joy of High Level Languages

  • Posted on February 16, 2010 at 6:06 pm

When programming you should use a high level programming language. This is important since you do not ever again have to deal with the intricacies of the platform you are working on. Coding becomes paradise. And the Earth is flat, and pigs can fly. I’ve spent over ten days tracking down a problem of Awstats stopping to update the web statistics. The configuration was copied from the old server, as were all the logs, the previous configurations, everything. Yet Awstats did not generate new statistics.

Finally I found an unsuspecting line in the logs. It went: „Warning: Error while retrieving hashfile: Byte order is not compatible at ../../lib/Storable.pm” It’s just a warning, so it’s nothing to be worried about, right? And since we use a high level language surely the change from 32 bit to 64 bit Debian cannot make a difference, right? We code in high level, we do not deal with byte orders and other wordly stuff anymore. We are enlightened. And obviously we are fucked. Thanks to a hint on a blog somewhere the web statistics are working again.

I will continue my text indexer project today. It’s written in C++.

Tweet me alone!

  • Posted on February 9, 2010 at 11:17 am
How to use Twitter to watch over idiots.

How to use Twitter to watch over idiots.

Well, why do people leave their insides on the  doormat of every stranger that give them cookies? Please explain.

Sexuelle Übergriffe vermeiden

  • Posted on February 7, 2010 at 10:02 pm

Ich habe zufällig einen Text gefunden, der zu einem aktuellen Anlaß paßt. Es geht um sexuelle Belästigung oder Schlimmeres in diesem Zusammenhang. Es ist einer Freundin passiert und Anwesende haben nicht geholfen. Stattdessen wurde der Vorfall heruntergespielt. Das darf nicht passieren.

Special Agent Carrie Nelson, CRP coordinator, said another key factor in preventing sexual assaults is bystander intervention.

“It is simply standing up and doing the right thing. It’s intervening on your buddy’s behalf when you see him or her taking advantage of someone too intoxicated to understand what’s happening. It’s making sure your shipmate gets home safely. It’s preventing an assault before it happens,” said Nelson. “Bystander intervention is an obligation that every one of us shares. It is our duty to step up and stop someone from becoming a perpetrator or a victim.”

Some examples of bystander intervention include making up an excuse to get someone away from a risky situation, pointing out disrespectful behavior, recommending to a bartender or party host that someone has had too much to drink, trying to keep someone from going to an isolated location, and maybe even calling the police.

Nelson pointed out that the most effective intervention often involves taking action before a crime can occur. She added it is important for bystanders to always keep themselves safe, know options that are available, attempt to have other people assist, and avoid violence.Special Agent Carrie Nelson, CRP coordinator, said another key factor in preventing sexual assaults is bystander intervention.

“It is simply standing up and doing the right thing. It’s intervening on your buddy’s behalf when you see him or her taking advantage of someone too intoxicated to understand what’s happening. It’s making sure your shipmate gets home safely. It’s preventing an assault before it happens,” said Nelson. “Bystander intervention is an obligation that every one of us shares. It is our duty to step up and stop someone from becoming a perpetrator or a victim.”

Some examples of bystander intervention include making up an excuse to get someone away from a risky situation, pointing out disrespectful behavior, recommending to a bartender or party host that someone has had too much to drink, trying to keep someone from going to an isolated location, and maybe even calling the police.

Nelson pointed out that the most effective intervention often involves taking action before a crime can occur. She added it is important for bystanders to always keep themselves safe, know options that are available, attempt to have other people assist, and avoid violence.

Diese Empfehlungen sollte sich jeder zu Herzen nehmen und bei passender Gelegenheit auch umsetzen.

  • Comments are off for Sexuelle Übergriffe vermeiden
  • Tags:

Top