Merlin’s weekly podcast with Dan Benjamin. We talk about creativity, independence, and making things you love.
Merlin’s weekly podcast with Dan Benjamin. We talk about creativity, independence, and making things you love.
”What’s 43 Folders?”
43Folders.com is Merlin Mann’s website about finding the time and attention to do your best creative work.
Academic notes in one big text file + tag clouds
mdl | Nov 3 2006
Back when I first started writing my dissertation (seems like another lifetime now), I used FileMaker Pro to store all my notes. After hundreds of dollars and a lot of headaches, I now realize that I could easily have used the command line and text files. And I would have ended up with a much more portable, robust, and versatile data bank than anything FileMaker could offer. I present my quick and dirty note storage solution here as a command line newbie, so I would be grateful for any tips or help. Thought it might be of interest to all the grad students and academics out there. 1) I take notes on index cards. This limits me to "atomic" bits of information, thwarting my bad habit of keeping a massive stream of unbroken notes. (When I used FileMaker, some of my entries were several pages long, clearly defeating the purpose of a database). 2) When I process my inbox, I append these new notes to a growing text file. Each note occupies a single line and gets a unique date/time stamp. It also gets a bibliographic code (author-date of publication - e.g., Wallace03), which I can use to search a separate .bib file. (I save the index cards as a paper backup.) 3) I tag each note with several keywords, using "kw" to set them apart. E.g., kwrenaissance kwprinter kwstatistics kwflorence kw1490s- these might be tags for a note on the number of printers in Renaissance Florence. 4) I can now quickly search my notes using grep. If I want to look only for keywords, then I make sure to begin my search query with kw. So let's say I wanted to see my notes tagged "Beethoven." Well then I would type "grep kwbeethoven notes.txt." (I leave all my keywords uncapitalized.) Immediately all the relevant lines pop up. If I want a broader search, then I leave out the kw (and be sure to turn on case insensitive [-i]). 5) If I'm not sure whether I've used a keyword before (or if I'm lazy), I can always count on VIM's autocomplete feature. Let's say I want to enter a keyword for Roman Law but can't remember whether I've previously used kwlawroman or kwromanlaw. Well, then I simply type kwrom followed by CTRL-N or CTRL-P, which allows me to flip through all the other words beginning with kwrom in the document. I can do the same thing for kwlaw. 6) tr -cs A-za-z '\012' < notes.txt | sort | uniq -c | grep kw | sed 's/kw//' > keywords.txt And finally, this ugly little command will pull out all the words beginning with kw, strip them of that unsightly kw, count how many times they appear in my notes, and put the results in a text file. I'm sure there would be a prettier way to do this, but it works OK for me. So I get a nice little tag cloud of all my keywords together with frequency, which looks like this: 2 1830s I guess my only question now is how big my notes file can become before it gets too slow and/or unwieldy to work with. I'm thrilled to discover the simplicity of text. It's free, universal, and so much more secure than my other solutions. And with grep and redirection, it's so easy to pull together a new, smaller text file of notes on a particular subject. The nice thing is that I can always go from here to other platforms. At some point, I plan to split my big text file into single-entry files and import all these little files into DevonThink, so that I can take advantage of that program's "See Also" feature. But I like the text file. Besides, I can access my database from anywhere using SSH. 15 Comments
POSTED IN:
Forgot to mention bibliographiesSubmitted by mdl on December 11, 2006 - 2:01pm.
For bibliographical entries, the fields are as follows in my text database: source::entry::keywords::annotation or... bxBossyCW86::Bossy, John. Christianity in the West, 1400-1700. Oxford: Oxford UP, 1986::fxhistory fxreligion lxeurope txearlymodern kxchristianity kxreformation kxchristianization kxmodernization::Synthesizing essay on transformations of Christianity during early modern period. From ritual/community to doctrine/discipline. The resulting export (using a customized "awk" command) looks like this: --------------------------- BossyCW86 Bossy, John. Christianity in the West, 1400-1700. Oxford: Oxford UP, 1986. history religion europe earlymodern christianity reformation christianization modernization Synthesizing essay on transformations of Christianity during early modern period. From ritual/community to doctrine/discipline. _______________________ The neat effect: By exporting my bibliographical entries with the source listed first, DT turns the reference code into the title of each bibliographical record. If the autolink feature is turned on, I get automatic links from the source in each notes entry to the corresponding bibliographical record. Someone with more ambition could probably set up a script to copy the keywords to the comments line in DevonThink. But DevonThink's AI feature is nifty enough to make sense of these tags as simple words within the text. I like to strip away my keyword prefixes: otherwise DevonThink has trouble linking the tags with other clippings in my database. So DevonThink gets all the random ephemeralia (webclippings, email, images, etc.). For my own sanity, I like keeping my academic notes in a more controlled, structured, and portable form, so that they don't get mixed up with a lot of DT detritus. But I can now integrate my more structured text notes file with my growing DT repository. I'm working on automating the export a little more. I'll post the results of that endeavor later. Hope this helps! » POSTED IN:
|
|
EXPLORE 43Folders | THE GOOD STUFF |