Archive for March, 2011

Inside NIF: Working with Search Results

Posted on March 30th, 2011 in Inside NIF, News & Events | No Comments »

The Neuroscience Information Framework currently contains millions of unique data records. After your first search, you may be overwhelmed by the scope of the search results. This Inside NIF post provide tips and tricks for using the search results interface (shown below).

Resize the Viewing Area

You can increase the size of the viewing area by moving the area between the search query and the results.

Resizing, Sorting, Show/Hide Columns

There are several options for managing tables within the search results. To resize columns, place your mouse at the border between two columns until the resize icon appears. Then left click and drag the column to the desired size. To sort data within a column, mouse over the column header until a down arrow appears to the right of the column header. Click on the arrow and select “Sort ascending” or “Sort descending.” To turn off individual columns, perform the same procedure but navigate to the “Columns” menu below the sort options.

More Results per page

To change the number of results within a page, select a value from the list on the upper right corner of the search results window.

Export

To export results as a comma-separated value text file (csv), click on the “Export” link in the upper right corner of the search results window.

For more tips and tricks, as well as tutorial videos check out our NIF Tutorials page!


Inside NIF appears every Wednesday on the NIF Blog. Join us each Wednesday to learn more about what’s happening at NIF, your Neuroscience Information Framework.

 

NIF Data Spotlight: AddGene – What was that plasmid?

Posted on March 28th, 2011 in Data Spotlight, News & Events | No Comments »

We have all been there before. Whether learning or re-learning the basic biology of DNA plasmids, figuring out the best plasmid for your line of research can be a difficult task.  This week’s featured NIF database, AddGene, is greatly improving this process. AddGene maintains a high-quality plasmid repository which allows researchers to submit and deposit plasmids. These plasmids are then cataloged and linked to published articles so that scientists can easily find data and other information related to the plasmid of interest.

The “Brainbow” Nature article, “Transgenic strategies for combinatorial expression of fluorescent proteins in the nervous system” by Livet et. al. (2007) provides an illuminating example.  By exploiting the Cre-Lox recombination system, these researchers were able to induce stochastic expression of several fluorescent proteins across a population of neurons. As a result of the Brainbow transgenes, individual neurons and their projections become labeled with a distinct color.  This strategy, therefore, provides a novel way to visualize neuronal network architecture with precise detail and significantly improves computer-aided tracing of cellular interactions (see figure below).

Ready to add some color to your research? No problem with the help of AddGene! A quick search forBrainbow at Addgene provides all the information necessary to evaluate and purchase each of the Brainbow transgenes discussed in our article. At NIF, the AddGene database is a registered resource of the NIF data federation and search results can be found under Data Type > Plasmids. For additional information about NIF and AddGene, we have an example available at NIF Tutorials.

 

 

 


The NIF Data Spotlight is a weekly blog post highlighting the databases, information, and resources curated by the Neuroscience Information Framework. For comments, questions or concerns feel free to drop us a line at curation@neuinfo.org.

NIF Webinar – March 29, 2011 / Topic: KEfED, Knowledge Engineering from Experimental Design

Posted on March 22nd, 2011 in Webinar Announcement | No Comments »

The Neuroscience Information Framework (NIF) hosts Webinar series on topics focused on collaborating with NIF, getting involved in building the NIF vocabulary, using NIF portal resources, as well as other appropriate NIF topics.

Hello everyone,

The next NIF Webinar is scheduled for Tuesday, March 29, 2011. Please join Dr. Gully Burns from University of Southern California as he describes KEfED, Knowledge Engineering from Experimental Design. The project describes an approach to building a Knowledge Representation for scientific observations. This approach is designed to provide a lightweight representation for scientific knowledge that is (a) generalizable, (b) a suitable target for text-mining approaches, (c) relatively semantically simple, and (d) is based on the way that scientist plan experiments and should therefore be intuitively understandable to non-computational bench scientists. The team has constructed a web-application as a preliminary instantiation of this approach to demonstrate its feasibility and to gather feedback from the community.

Date and Time: Tuesday, March 29, 2011 • 11:00-12:00 PST
Topic: KEfED – Knowledge Engineering from Experimental Design
Presenter: Gully Burns
URL: http://connect.neuinfo.org/webinar
Dial-In (toll-free): 866-740-1260
Access Code: 8220739

Mark your calendars! See you there.

NIFarious Ideas: Science collects data, the Brain improvises

Posted on March 11th, 2011 in News & Events, NIFarious Ideas | No Comments »

As an enterprise, science exists to collect objective data from the natural world, a job opportunity afforded to science by the innate fallacies of the human brain, according to Dr. Neil deGrasse Tyson in a recent Point of Inquiry interview on Communicating Science. That is, the brain is particularly apt at “discovering” patterns or connections between objects when, in fact, they do not exist. As noted by Dr. Tyson, this ability is very interesting to scientists studying the brain, but can be very troublesome to scientists studying physical properties of the real world.

At the Neuroscience Information Framework, we are taking a stance somewhere in the middle. Our project aims to become the central portal for all things neuroscience on the Web, particularly in regards to data which is hidden below the horizon of traditional Internet search engines. With more than 60 registered data resources and growing, NIF now offers unmatched accessibility to publicly shared neuroscience databases. NIF integrates these resources with semantic web technologies to improve the ability of machines to perform computations across the NIF data federation. However, knowledge discovery is a process in which computers and humans must work together, supplementing each others’ abilities or, in this case, fallacies.

Computers or automated agents provide the human user with superior working memory and unbiased information processing. The human brain, on the other hand, has unique abilities to quickly adjust or adapt knowledge with new information. At NIF, we are currently investigating visual analytics platforms and other data views to support this knowledge discovery process. In the end, we seek to provide the most effective ways to organize and present this data to our users such that patterns between data can be discovered and investigated further.

 


NIFarious Ideas is a regular weekly column on the NIF Blog that appears every Friday. We seek to highlight the avant-garde, the dangerous, the progressive, the cutting edge in software tools, databasing, ontologies, searching, data collecting and distributing, and of course, neuroscience trends. Join us each Friday — Be NIFarious!

NIF Webinar – March 15, 2011 / Topic: The SWAN Annotation Tool and Annotation Ontology

Posted on March 9th, 2011 in Webinar Announcement | No Comments »

The Neuroscience Information Framework (NIF) hosts Webinar series on topics focused on collaborating with NIF, getting involved in building the NIF vocabulary, using NIF portal resources, as well as other appropriate NIF topics.

Hello everyone,

The next NIF Webinar is scheduled for Tuesday, March 15, 2011. Please join Dr. Paolo Ciccarese from Harvard University as he describes the SWAN Annotation Tool and Annotation Ontology. The SWAN Annotation Tool is a web application for producing annotation of online documents and document fragments. The tool allows to define personal annotation, public annotation and annotation that can be accessed only by a selected group of users. The SWAN Annotation Tool is developed in parallel with the Annotation Ontology (AO) an OWL vocabulary for representing and sharing annotation. AO is designed to extensively reuse existing domain ontologies (entities annotations or semantic tags) and to provide several other kind of annotations – comments, textual annotation (classic tags), notes, examples, erratum… – on potentially any kind of resource (text, images, audio, databases…) and resource fragment.

Date and Time: Tuesday, March 15, 2011 • 11:00-12:00 PST
Topic: The SWAN Annotation Tool and Annotation Ontology
Presenter: Paolo Ciccarese
URL: http://connect.neuinfo.org/webinar
Dial-In (toll-free): 866-740-1260
Access Code: 8220739

Mark your calendars! See you there.

The Factorial Problem in the Annotation of the Scientific Paper

Posted on March 2nd, 2011 in Anita Bandrowski, Essays, Force11, News & Events | No Comments »

For the better part of the last 200 years, scientists and researchers have been honing the process of publishing scientific articles by carefully citing the people and ideas leading up to their own thinking, they have refined how they speak about their own discoveries, and they have become much more prolific as a community. Indeed, the “publish or perish” mantra has led to an unprecedented proliferation of publishable material, so much so that no scientist can reasonably expect to read everything in his or her field.

We have just heard at the Beyond the PDF conference and echoed throughout the scientific community that the scientific paper should have live links to data, videos and software tools so researchers can almost, practically, recreate experiments. The scientific paper should have a provenance of scientific discourse, i.e., who said what about whom. The scientific paper should really not be called a paper at all; it should be called a publication and should never again be thought of as a flat object.

Okay, great, all these new dimensions will then become exactly that, orthogonal axes to the text, and there will be references to all these wonderful and useful things. However, authors and authoring tool builders may begin a bit of a backlash, if they have not done so already. Why? Well, let’s just consider the standard paper as using n repositories of data, consisting of 5 vendors for materials, 1 database where data are stored that the paper was based on, 3 software tool pointers at different open and closed source repositories with which the data were analyzed, running on some well-defined platform, and a list of 4 ontologies which were used to annotate the various important parts of the paper. Now, consider that the author or toolmaker would need to access these repositories of data, software tools, vendors and ontologies to pull the unique identifiers that allow text-mining systems to “read” the paper. That is fine; in this example, the total number of databases that would be accessed is only 13-16, depending on whether we want to consider system emulation as part of our parameters.

In astronomy, which runs entirely on a few very large datasets accessible by everyone in the community, this problem is solved. The community sets standards for scientific discourse, Pluto be damned! Everyone links to everyone they care to link to, and all communication issues can be boiled down to whether the astronomers are talking to other astronomers or to journalists about to spin an astrology piece pointing to some new zodiac sign.

So back to our non-astronomy science paper above, let us now consider that each of the parties involved would need to link to each of the other parties involved. For example, the mouse provider would like to say that this article discusses some new discovery based on their knockout mouse, the software tools would like to point to datasets useful for testing data, ontologies would like to capture new instances, etc. If we take this to the extreme, we are faced with a factorial (n!) connection between the various players in the scientific publication process. Now let’s consider that data can be stored in one of 2061 databases useful to neuroscience, and we see that a rather large number of the connections emerge. The problem of connectivity becomes a large tangled “hairball” from which no answer can come other than “this is too darn complicated.”

hairball image

Caption: hairball (n!) vs. one link solution (n+1)

So how do we disentangle the “hairball” and let the tools that people produce help scientists write new research publications (not flat papers and not tangled hairballs)? Here is one possible answer: Create a platform which the whole community can point to, which can route the appropriate links to the appropriate place, reducing the overhead of “everyone linking to everyone” (n!) down to a manageable “everyone linking to one” (n+1) problem?

That seems like a useful exercise, and an exercise that is emerging as a theme in the shared names, DataCite and NIF projects.