|
January 01, 2009
A Crypto Problem
My #3 Son was reading Poe's The Gold Bug and so he and I were discussing solving crypto puzzles. My son mentioned the letter frequencies Poe gave and I said Poe was wrong. So I asked him if he was sure he remembered it correctly. My son said he was sure he remembered it correctly and got the book out to prove it. Of course I had read Poe many years ago, but I didn't remember the letter frequencies Poe gave. Here are the letter frequencies Poe gave in order from most common to least as: e a o i d h n r s t u y c f g l m w b k p q x z In Linotype usage the frequencies are: e t a o i n s h r d l u c m f w y p v b g k q j x z and like any good crypto puzzle solver let me put them one on top of the other: e a o i d h n r s t u y c f g l m w b k p q x z Do any of you have an explanation for the differences at least among the most common letters? Was he trying to make it harder for his readers to solve the crypto puzzles he often published? Should you care to read or re-read the story here is an on line version of The Gold Bug from Project Gutenberg. And here is a paperback version: The Gold-Bug and Other Tales. Cross Posted at Power and Control posted by Simon on 01.01.09 at 09:02 PM
Comments
Here's someone who thinks Poe is playing games: Dennis · January 1, 2009 10:59 PM I speculate that Poe didn't have access to a good frequency table and generated his own, perhaps from an atypical sample, or from too small a sample size. SBP · January 1, 2009 11:03 PM Why doesn't Poe include "j" and "v"? Anonymous · January 1, 2009 11:07 PM The frequencies he gives are from his own cryptograms, none of which decodes to "vajayjay," because it was old-timey days. guy on internet · January 2, 2009 01:21 AM SBP, You would have a point if the letters "in error" came at the end of the table. However misplacing t d and h so significantly and t especially seems intentional. It might be interesting to scan a number of texts to see if it is possible to find one with that frequency pattern. Poe may have hidden a cryptogram in his letter frequency chart. M. Simon · January 2, 2009 02:33 AM Wasn't Poe an accomplished cryptographer who claimed that no one in the world could design a cipher he couldn't crack? He apparently succeeded when he challenged readers of his newspaper column to try to stump him. It seems more likely that he gave bad information on purpose. Perhaps he didn't want to give away too much information. It might generate interest in the puzzles but not give people an absolute key to solving them. The rest of his readers, who still didn't go in for the puzzles, would be none the wiser but could still enjoy the story with its internal logic. Dennis · January 2, 2009 09:19 AM No mystery. Poe wrote 150 years ago. English word use was different then. And Poe was just one person, not the English Speaking Peoples. His table might have been right for the United States of his day. An interesting test would be a table made from Poe's writings. That would be English as he used it. Type would have still been set by hand for Poe. Mark Twain famously invested in early typesetting machines and lost a fortune. K · January 2, 2009 11:50 AM Intriguingly, this is the ranking for a book cipher which uses the first letter of each word. The ranking for first letters in English is t o a w b c d s f m r h i y e g l n o u j k whereas that for every letter in English is as you quoted. If you multiply the numerical frequencies for first letters by the numerical frequencies for every letter, the ranking will be close to that given by Poe. Alex Green · January 2, 2009 11:53 AM Alex, That is very interesting. M. Simon · January 2, 2009 12:49 PM Why doesn't Poe include "j" and "v"? Now this is interesting. Classical Latin uses j and i interchangeably, and the same for u and v. I wonder if Poe was working from a Latin text? SBP · January 2, 2009 02:00 PM The Wikipedia page doesn't give a table for Latin, but the Italian table has "t" at 5.62% and the Spanish table has "t" at 4.63%, both significantly lower than the 9.06% for English. SBP · January 2, 2009 02:08 PM Kluber gives the percentages for Latin as: I - 10.1 M - 3.4 V - 0.7 Alex Green · January 2, 2009 03:47 PM Poe's table is probably based on a pretty small sample size. Note that in Dave · January 2, 2009 08:47 PM "Son #3"? I didn't even know you had any kids! Aakash · January 6, 2009 01:42 AM Aakash, Three sons, one daughter. M. Simon · January 6, 2009 01:53 AM Post a comment
You may use basic HTML for formatting.
|
|
January 2009
WORLD-WIDE CALENDAR
Search the Site
E-mail
Classics To Go
Archives
January 2009
December 2008 November 2008 October 2008 September 2008 August 2008 July 2008 June 2008 May 2008 April 2008 March 2008 February 2008 January 2008 December 2007 November 2007 October 2007 September 2007 August 2007 July 2007 June 2007 May 2007 April 2007 March 2007 February 2007 January 2007 December 2006 November 2006 October 2006 September 2006 August 2006 July 2006 June 2006 May 2006 April 2006 March 2006 February 2006 January 2006 December 2005 November 2005 October 2005 September 2005 August 2005 July 2005 June 2005 May 2005 April 2005 March 2005 February 2005 January 2005 December 2004 November 2004 October 2004 September 2004 August 2004 July 2004 June 2004 May 2004 April 2004 March 2004 February 2004 January 2004 December 2003 November 2003 October 2003 September 2003 August 2003 July 2003 June 2003 May 2003 May 2002 AB 1634 MBAPBSAAGOP Skepticism See more archives here Old (Blogspot) archives
Recent Entries
A Few Economic Sparks
Hamas Can't Phone Home The Exterior Of The Interior Ministry The Ezekiel Project - Update Colonels Look At The Mexican Collapse The American Peole Must Make a Wise Choice A Lack Of Solidarity Now We Know Who To Blame The First Crook Resigns Operation Cast Lead
Links
Site Credits
|
|
Simon Singh's terrific "The Code Book" has a crypto problem at the end. He offered money to the first person to solve it. Someone did.