Archive

Archive for the ‘Geek stuff’ Category

Sending emails with EC2 and correct reverse-dns

April 24th, 2010

A major problem for many users of EC2 has been sending emails because of issues with reverse DNS causing it to be regarded as spam many places.

I got it working just fine using the Elastic Load Balancer. I configure exim “dpkg-reconfigure exim4-config” to use my elastic load balancer as it’s Mail Server Name. For example, if I have the website example.com, I make sure the DNS for this site points to my load balancer (and in the load balancer I register the server/s hosting my site). So, when I send mails from somebody@example.com it will reverse DNS correctly.

p.s. you cannot actually point the DNS for example.com to the load balancer since the load balancer is not a IP you have to use CNAME and can only point www.example.com (or any subdomain) to the load balancer. You could point example.com to an elastic IP for one of your servers and have permanently point example.com to www.example.com.

elfar Geek stuff

Bioconductor 2.5

October 29th, 2009

Bioconductor 2.5 has been released, and many new packages are included. A fair amount of the new packages are related to analysis of data from sequencing platforms (Chip-seq and RNA-seq) and analysis of miRNA targets, just to mention some of the packages that looked interesting. See the full statement here.

Troels Geek stuff

The evolutionary history of human microRNAs in 10 lines of Ruby code

October 11th, 2009

If you have a well-structured data model and an intuitive framework for querying your data it suddenly becomes feasible (and fun) to quickly address smaller scientific questions as they arise. If you use an expressive and intuitive query language it is also a lot easier to share your query/code with people in your group or in a scientific forum. miRMaid (www.mirmaid.org) provides such a software framework for miRNA data. If you are interested in microRNAs, or in doing something similar in your own data domain, then read on.

Click to continue reading “The evolutionary history of human microRNAs in 10 lines of Ruby code”

anders Geek stuff, microRNA, resource, ruby , ,

Automatic PDF fetching of articles

October 3rd, 2009

Tired of clicking your way to the article PDFs you need? Check this out and find out how you can fully automate this process.

Click to continue reading “Automatic PDF fetching of articles”

elfar Geek stuff , , ,

Geeky weekend

October 2nd, 2009

Just happened to stumble across this very geeky post on blog updating from emacs and R. I am sure that my fellow geeks and I will be spending some time during the weekend checking this out. Maybe the readers of this blog will be spammed with R-code, even more, in the near future. Have a nice weekend.

Troels Geek stuff

Analysis of gene expression

September 27th, 2009

A small rant about two different, and very useful methods for analyzing expression data from micro arrays.

Click to continue reading “Analysis of gene expression”

Troels Algorithms, gene-regulation , ,

Power plotting with ggplot2

September 21st, 2009

The importance of good data visualization is hard to overstate. However, as soon as you start to get multiple related datasets with many different variables, making useful graphics becomes a major challenge. Excel is of-course a no go for biogeeks and usually we and most of our geeks-colleagues around the world turn to R. However, R’s basic graphics capabilities is not very well suited for organizing and making lots of plots at the same time.

Here is a pharmacology example: Say you want to make 12 dose-response plots of various compounds tested in various cell lines. With basic R this would require writing a for-loop and fidling around a lot with axis and plot labelling and the par()-function to make them fit on one page. With basic R you would have be extremely careful to make the code general and reusable for next time when you have different compounds and different cell lines.

Enter ggplot2 and the grammar of graphics. ggplot2 is a package for implementing the grammar of graphics, which allows you to write extremely succinct and natural languages like code that produces stunning visualizations. Here is some code

p = qplot(Concentration, Percent.of.control, 
               data=screening_data, 
               geom=c("point", "smooth"), colour=Response.type) +
        scale_x_log10() +
        facet_grid(Compound ~ Cell.line) +
        coord_cartesian(ylim=c(-10, 110))
print(p)

And now the plot:

ggplot2 dose response curves with facetting

If you compare the code and the plot you will realize that the code contains about the words that you would use if you were told to briefly describe the plot using English. Me like!

If you want to compare differences in potency of the compounds in different cell lines that can easily be accomplished by changing the variables:

q =  qplot(Cell.line, Percent.of.control, 
                 data=screening_data, 
                 geom=c("point"), colour=Response.type) +
        facet_grid(Compound ~ Concentration) +
        coord_cartesian(ylim=c(-10, 110))
print(q)

ggplot2 example with variables switched

In many ways it is similar to lattice, but Ggplot2 is more opinionated, e.g. it forces you to organize your data in long data.frames. Conveniently it includes several useful tools for reformating your data structures.

Read more  in the draft ggplot2-book, which freely available from the developer.

Morten Geek stuff

Making the best of a bad situation

September 13th, 2009

Next generation sequencing offers a lot of extremely interesting opportunities, but it also presents us with at least as many difficulties – especially when you are working with ancient DNA. I have already touched upon the problem of primer contamination in the data in a previous post. Here, I want to mention a few problems with PCR artifacts.

Click to continue reading “Making the best of a bad situation”

Stinus Geek stuff, introduction , , , ,

Practical RNA structure visualization with VARNA

September 6th, 2009

VARNA (Visualization Applet for RNA) (http://varna.lri.fr/) is a new tool for RNA secondary structure visualization and it has a lot going for it: open source, pretty structure rendering, export to svg/jpg/eps, interactive inspection of structure with change of layout algorithm (linear/circular/radial). And most importantly: it is so simple to use that it is actually practical! If you are a teacher/student in a class covering RNA structure you should really have a closer look at VARNA. As a researcher, it is very easy to store dynamic VARNA-enabled RNA secondary structures on your computer – and in some situations this is really what you want.

Click to continue reading “Practical RNA structure visualization with VARNA”

anders Cool Tools, Geek stuff ,

Automation with Fsniper

May 21st, 2009

Fsniper is a great, easy-to-use tool to automatically watch over your folders/files and perform a given action every time there is new file in the folder or a change is made to any of the files. For example, I develop some websites locally, but I also test them on a remote test server. To make sure my local machine and remote server are always in sync I simply have fsniper watch over my web directory and have it run rsync every time a change in this directory occurs. This is just one simple example, fsniper is actually so flexible that only your imagination sets limit to what you can do. Read more to see how to get started using fsniper…

Click to continue reading “Automation with Fsniper”

elfar Cool Tools, Geek stuff ,