News

You are currently browsing the archive for the News category.

The meeting is open to in-warranty Sorcerer customers and by invitation only. Pre-registration is required. A light buffet and refreshments are being provided, and there will be a drawing for customer door prizes. We will have the brand-new, ultra-cool Apple iPad as our door prize! (But make sure you come on time for the best chance to win!)

We are privileged to have Profs John Yates (Scripps) and Steven Gygi (Harvard) confirmed to give a talk. We will also have training talks on the new SEQUEST 3G and the new VersaSearch technology on the SORCERER platform.

If you wish to receive a meeting invitation, please contact: tnowak@sagenresearch.com.
Seating will be limited, so reserve your spot today!

Date: Sunday 23rd May 2010
Time: 1:30 PM to 5:00 PM
Address: Hotel Monaco, 15 West 200 South, Salt Lake City, UT 84101 (801) 595-0000
Room: Suite Paris A

Hope to see many of you there!


What exactly is SEQUEST 3G? What can it do for me?
SEQUEST 3G is the latest, next-generation SEQUEST standard specifically developed and optimized for translational proteomic applications involving phosphorylation and other post-translational modifications (PTMs). Defined by Sage-N Research in close collaboration with Dr. John R. Yates, III of the Scripps Research Institute, SEQUEST 3G defines a single common standard for similarity scores, search parameters and statistics, and input/output file formats and is ideal for noisy and poor quality spectra. This robust proteomics search engine maintains compatibility with existing TPP and Scaffold workflows, and incorporates new features of the latest technologies such as electron-transfer dissociation (ETD) and high mass accuracy instruments. SEQUEST 3G also supports multiple scoring and rescoring algorithms such as ASCORE. In addition to improved sensitivity identification for phosphopeptides and low-abundance spectra, the SEQUEST 3G is a significant update that enables future complex functionality and addresses prior version incompatibilities and calculation variances.

How do I obtain SEQUEST 3G?
SEQUEST 3G is being released as an embedded component within the SORCERER v4.0 platform being released in Q1 of 2010. Whereas earlier versions (such as SORCERER-SEQUEST) relied upon a single pass, the new v4.0 platform uses a more efficient “multiple-pass” search engine architecture involving a first-pass with SEQUEST 3G to keep the top 50 candidates and then subsequent rescoring passes with other search engine modules. Current customers will receive an automatic Q1 update as part of their standard maintenance package. This update is conveniently backward-compatible in two ways: the subsequent pass is optional, and can be combined with other rescoring functions in a MUSE scripting environment.

SEQUEST 3G is available for licensing within third-party bioinformatics software suites for a variety of mass spectrometers and technologies. A SEQUEST 3G press release can be found atwww.sagenresearch.com/news_11.html

Read the rest of this entry »

We have developed an new flow for processing Thermo RAW files that works both with the most recent XCalibur V2.1, as well as with earlier versions. This flow has been giving good results in internal testing, and we are now releasing it for beta testing to any interested, actively supported Sorcerer customer.

Thermo LTQ Velos users will have noticed the major changes to the XCalibur software that were introduced at version 2.1. The installation process is different, and requires a new component called Thermo Foundation, and some of the file names and locations have changed. All of these changes are no longer compatible with the ReAdW program that is used within the CrossOver environment by Sorcerer. One workaround which has been commonly suggested in the Thermo field is to down-rev the XCalibur used on the instrument to V2.0 and to continue using the old software for analysis. This remains a viable option, but with our newly developed solution, it is now also possible to use 2.1 RAW files on Sorcerer.

We are moving to a new spectrum extractor called msconvert (part of the ProteoWizard suite)  which works with a different version of the Thermo libraries, and for which we have developed a new integration in the CrossOver environment. We are offering this as a beta release to our in-warranty customers. This solution  entails a few Linux operations to reinstall CrossOver with the latest release, to configure the required libraries and to install a new Sorcerer workflow script; it is fairly straightforward for people comfortable with the Linux environment, or alternatively, we can do it for you if you give us remote access to your system. Please contact us at support@sagenresearch.com for more information.


Prof. Alexey Nesvizhskii (left) of University of Michigan receives a thank-you gift from David Chiang after his talk.

If you really want to understand how peptide and protein identification is done, this video talk is a must-see!

Professor Alexey Nesvizhskii of the University of Michigan is one of the co-inventors (with Dr. Andy Keller) of the popular PeptideProphet/ProteinProphet algorithm for turning search engine results into statistically consistent peptide and protein identifications. (This algorithm is also the basis for the popular Scaffold software.)

At the “Translational Proteomics 2.0″ meeting, we were privileged to have Alexey give his insightful talk that reviews the various steps involved in inferring peptide and protein identifications from large spectra datasets.

In this talk, you will learn why False Discovery Rates are preferred over P-values, why you probably should not run more than 4 replicates of a MudPIT experiment, how FDR estimations from decoy differ from Peptide/ProteinProphet, how “The Two Prophets” compute probabilities by curve-fitting the score distributions, how sensitivity and FDR are computed, and the what and why of some advanced TPP options.

The talk is available at: http://www.scivee.tv/node/12671 (45 minutes).

I recommend using the “full screen” mode so you can view the slides, which are also available as a download from the site. (Please be aware that the slideset order is different from that in the presentation.)

(Note: Both Trans-Proteomic Pipeline and Scaffold Batch software are integrated into the SORCERER platforms.)


“Translational Proteomics 2.0″ 2009 Users Meeting in Philadelphia.
Guest speakers Jimmy Eng (UWashington), Alexey Nesvizhskii (UMichigan), Josh Elias (Stanford), along with SAB member John Yates (Scripps) are in the middle row.


Stanford’s Dr. Chris Adams (left) must be feeling pretty lucky!
He gets to use a SORCERER 2 for his research (as part of Allis Chien’s mass spec core facility), AND wins an Acer One netbook door prize from David Chiang!

Translational proteomics — aka Proteomics 2.0 — is high-sensitivity proteomics for translational research, whose mastery is your key to unimaginable fame and fortune in biology and medicine!

Whether you need to catch up or to keep up, you need to hear the leading proteomics technologists reveal their secrets!

We were fortunate to have three of most accomplished technologists (Mr. Jimmy Eng, Prof Josh Elias, and Prof Alexey Nesvizhskii) at our “Translational Proteomics 2.0 Meeting” give their insider insights on high-sensitivity data analysis.

In addition, we were privileged to have Sage-N Research SAB advisor Prof John Yates, one of the fathers of proteomics, attend our meeting and join in our lively panel discussions regarding the present and future of translational proteomics.

From the talks, these are tips for best sensitivity and specificity:

* There are several equivalent ways to calculate precursor mass, all of which can result in several AMUs of mass error due to incorrect isotope assignment.
* Semi-tryptic settings for database searching gives the best performance
* Use a wider mass tolerance than your experiments will yield
* However, you don’t need a wide mass tolerance for searching if (a) you use isotope shift check and (b) you have a decent source of noisy peptide, e.g. with semi-enzyme search
* Post-process peptide IDs with proper statistical tools (e.g. PeptideProphet, DTASelect or target-decoy analysis)
* Key is to monitor the false discovery rates (FDR) with different filtering criteria
* Use monoisotopic mass for fragment ions, and for precursor ions if using high-resolution instrument
* P-values or E-values are not good for large-scale proteomics, because they don’t give you estimated data rates for a given score cut-off, and they ignore other relevant factors (e.g. retention time, mass accuracy, etc.)
* The target-decoy method is a simple and effective means of FDR estimation. It gives scores more discriminatory power by improving signal-to-noise ratio.
* Can use search scores in combination with other characteristics to get more good IDs at a particular FDR than by using score alone

We will be publishing the meeting talks online. Watch this space for details!

Hear Khatereh discuss her work and her success with the SORCERER 2 system!

Dr. Khatereh Motamedchaboki is currently the Manager of the Proteomics Facility at the Burnham Institute for Medical Research.

She is one of our increasing number of two-time SORCERER success stories, as a previous user at the Ebrahim Zandi Lab at the University of Southern California.

Reference: Laurence M. Brill, Khatereh Motamedchabokia, Shuangding Wu, and Dieter A. Wolf, “Comprehensive proteomic analysis of Schizosaccharomyces pombe by two-dimensional HPLC-tandem mass spectrometry”, Methods (2009), doi:10.1016/j.ymeth.2009.02.023.

Click Here to See Video

Our R&D team is busy working on the next major version of the Sorcerer-PE software, and expects to release it to then-in-warranty customers in the next few weeks.  Early previews and beta tests of some of the components will be made available by arrangement to qualified customer sites.

Highlights of the upcoming release include:

  • ETD fragmentation support and analysis
  • MUSE scripting modules for rescoring peptide matches with Olsen-Mann and Sadygov-Coon scores
  • Interoperation with major components of the Yates lab Sequest suite, including the DTASelect filtering and statistical analysis tool, and the Census quantitation application
  • Enhancements to the SEQUEST engine which provide first-pass cross-correlation scoring and E-values for greater accuracy and sensitivity

Read the rest of this entry »

Three of the world’s leading experts on MS-MS protein identification came together recently at Sage-N Research’s annual user group meeting, and presented methods and results for the techniques and tools with which they are associated:

  • Jimmy Eng, co-inventor of Sequest and developer of many proteomics tools, presented tips for Sequest analysis
  • Josh Elias, who pioneered the systematic use of decoy databases for FDR estimation, gave a talk on how to use that technique to address Peptide ID signal-to-noise.
  • Alexey Nesvizhskii spoke about the tools he co-authored, in “Peptide identification and protein inference using PeptideProphet and ProteinProphet”

Their talks were very wide-ranging and full of practical insights for the proteomics user community, and they explored the different research interests, data sets, analysis methods and workflows in the individual labs.  However, they all had this in common: they had kept a careful eye on their search settings, monitored sensitivity and error rates, and come to a common, if perhaps not entirely intuitive, conclusion: the most sensitive search and the lowest error rates for shotgun proteomics are achieved when using semi-enzymatic searches — that is, when one end, but not both, of the peptide is allowed to diverge from the expected cleavage site.

Read the rest of this entry »

Sage-N Research is hosting its annual users’ meeting on the afternoon of Sunday, 31st May, immediately before the ASMS meeting in Philadelphia. We are proud to announce a compelling  agenda with talks from the principal developers of several of the key proteomics data analysis methods that are used as standard in the community, including SEQUEST, target-decoy search strategies, and Peptide/Protein Prophet.  The insights our clients will take away from this meeting will be very relevant to their use of Sorcerer, and promise to enhance their proteomics analysis productivity greatly.

2009 Users’ Meeting Arrangements

The meeting is open to in-warranty Sorcerer customers and by invitation only. Pre-registration is required. A light buffet and refreshments are being provided, and there will be a drawing for customer door prizes. Attending this meeting is your chance to win one of three Acer Aspire One netbook computers that we are giving away to our customers (must pre-register and be present to win)!Acer Aspire One Netbook

Date: Sunday 31st May 2009
Time:
1:30 PM to 5:00 PM
Address: Courtyard Marriott Hotel, 21 N. Juniper St, Philadelphia.
Room: Ballroom Level 1

Agenda

1:30 PM   Welcome and introductory remarks

1:45 PM    “What’s new in Sorcerer”
James Candlin, Sage-N Research, Inc.

2:05 PM    “Sequest analysis tips”
Jimmy Eng, University of Washington

2:45 PM    “Using target-decoy searching to visualize peptide identification signal-to-noise”
Dr. Joshua Elias, Stanford University School of Medicine

3:15 PM    Break and refreshments

3:50 PM   “Peptide identification and protein inference using PeptideProphet and ProteinProphet”
Dr. Alexey Nesvizhskii, University of Michigan

4:30 PM   Panel Discussion: “Putting it together: strategies for a productive proteomics analysis workflow”

4:50 PM   Concluding remarks

5:00 PM   End of meeting

APEX (’Absolute Proteomics Expression’) is a technique developed by Lu et al. for label-free quantitation of proteins based on MS-MS spectral counting of peptides. Unlike basic methods of this sort which suffer from variable detection probabilities that depend on the physiochemical properties of the peptides, APEX includes correction factors that predict the detection rates of the peptides for a better protein quantitation result.

There is an open source APEX Quantitative Proteomics Tool that implements this technique and that can use Sequest-based protein IDs as analyzed by the Trans-Proteomic Pipeline. Sorcerer users had the idea of using the tool in conjunction with Sorcerer, and now we have developed a workflow and MUSE script to help other users use this combination.

For more information, please read the application note ‘Sorcerer Workflow for the APEX Quantitative Proteomics Tool’.

We are pleased to announce the availability of the ISIS (Integrated Storage and Information System), which is configured and integrated to work directly with the SORCERER Enterprise bladecenter system to provide 4 to 100+ terabytes of integrated, protected storage for proteomics, genomics, imaging, and other repository needs. A second backup ISIS system can be configured offsite to provide additional backup and disaster recovery needs. To simplify maintenance and warranty for our clients, it will be covered under the same warranty plan as the SORCERER system for 3 years or 5 years.

The base ISIS system will provide approximately 4.1 terabytes of secure storage in a “2U” height, rack-mount system, consisting of twelve 450 GB SAS disks with 2 disk redundancy in RAID6.

In most countries, the ISIS system consists of the following:

- ISIS storage integration software interface running on SORCERER platform
- Fujitsu ETERNUS DX80 with single controller
- Approximately 4.1TB usable (12 x 1TB SATA disks in Raid6) per 2U rack, with up to 20 racks
- Min 3 year warranty is included (subject to the TSP coverage of the SORCERER)

Note that future expansion to 100+ TB will require additional ISIS expansion units or higher density SAS drives.

New clients can order the SORCERER Enterprise blade system with the ISIS system together as two rack-mount units. Clients with newer SORCERER 2 integrated data appliances with at least 8 CPU cores can simply add the ISIS to their existing system. (Older SORCERER systems will require a hardware upgrade.)

Please contact sales@SageNResearch.com for more information.

We are lining up an exciting program for our SORCERER clients on Sunday, May 31, 2009, from 1:30pm to 5:00pm, in Philadelphia, just before the ASMS reception:
http://www.asms.org/Default.aspx?tabid=209

Like last year, we will have plenty of toys and give-aways. It will continue to be a closed meeting, open to in-warranty clients and special invited guests only. Pre-registration will be required due to space limitations.

Please stay tuned for more details.

MSQuant, from the Centre for Experimental Bioinformatics (DK), is a leading tool used by the Matthias Mann group and others for quantitative proteomics, and in particular, SILAC analysis. It is a Windows program that is designed to take MS-MS raw files and protein IDs in the form of a Mascot Peptide Summary Report. So up until now, if you wanted to use MSQuant, the only practical way of doing it was to have Mascot installed and run it first.

Now, however, there another option: MSQuant users can use a Sequest/TPP-based toolchain for protein ID, and using a conversion utility, they can transform the ProtXML/PepXML files from TPP into a format which MSQuant can load.  Using Sorcerer’s scripting environment, MUSE, the transform can be done automatically as post-processing of a Sorcerer search. A further advantage of doing it this way is that the Sequest/TPP toolchain needs no special preparation of the input peaklist files to extract all the information that MSQuant requires for links to the scans in the raw file.

Read the rest of this entry »

Article on Sage-N Research and Thermo Fisher Scientific collaboration:

http://www.drugdiscoverynews.com/index.php?newsarticle=2475

by David.Chiang@SageNResearch.com

Orbitraps and other fast ion trap mass spectrometers (e.g. FT, LTQ) are popular instruments for discovery proteomics research.

The SEQUEST cross-correlation score is almost tailor-made for the spectral characteristics of ion trap data, whose information-rich spectra are challenging due to multiply-charged ions reported with relatively low fragment mass accuracy. This is especially important for analyzing noisy spectra that arise from low-abundance peptides and phosphorylated peptides, where the information content is embedded in the abundant small peaks.

However, you may be unaware how the basic SEQUEST functionality has evolved from the first ’sequest27′ prototype program to the latest SORCERER-SEQUEST implementation. 

Software continues to evolve to adapt to new requirements. Like a home remodeling job that never ends, at some point it becomes more practical to start over from scratch. After all, maintenance costs are several times higher than the initial development costs over the life of a software product.

The recommended architecture for high-throughput analysis is a client-server system architecture, which separates the interactive user client computer from the heavy-duty number-crunching server. This simplifies the sharing, updating, and backup of the central server, and isolates it from viruses and other sources of system instability from the user accessible client PCs.

Sequest27

Proteomic search engines were first invented by John Yates and Jimmy Eng at the University of Washington in the early 1990’s, based on the novel idea that a peptide sequence can be inferred not just from the tandem mass spectrum alone (i.e. de novo sequencing), but using known protein sequences as a reference.

The prototype search engine software was a standalone program named ’sequest27′ comprising approximately 3000 lines of C code. The source code has since been separately maintained by the Yates Lab and by Thermo, with PTM searches and other modifications added later. 

The ’sequest27′ program processes one mass spectrum at a time, and searches a protein sequence database from the beginning to end each time it is run. For example, to analyze a MudPIT experiment with 8,000 spectra, the ’sequest27′ program is run exactly 8,000 times to generate 8,000 output files, with no attempt to use information from one ’sequest27′ run to another. 

SEQUEST Cluster

The simplest way to scale up the throughput is to run the same program on many computers at once, such as in a Beowulf cluster architecture (http://www.beowulf.org/). 

The SEQUEST Cluster (”SC”) product once marketed by ThermoFinnigan uses this approach, with typically 4 to 32 Linux slave node computers running ’sequest27′ under the control of the Windows master node computer running Bioworks. 

The SC architecture partitions the set of input spectra into smaller sets for each node, and uses the master node to aggregate the results. While this approach is simpler to implement than partitioning the protein sequences, it requires each local disk to contain the same protein files, resulting in inefficient disk usage (i.e. a 16-node cluster searching the NCBI nr file must store 16 identical copies). As well, it makes the indexed search capability impractical. If the local files are large, then manually copying the files across the network to each node will take a lot of time.

To proteomics researchers new to clusters, the SC architecture seems to offer two benefits: (1) higher throughput than a single computer, and (2) ability to expand throughput in the future by adding nodes. 

However, the devil is in the details. In practice, the cluster may not offer higher throughput than an optimized, non-cluster architecture. As well, future expansion for this software architecture is impractical in light ofMoore’s Law

Depending on the search conditions, one high-end server (say with 8 GB RAM, 1.6 terabyte disk) with an optimized software architecture can outrun a 16-node cluster, whereby each slave node has 1/16th the resources (i.e. 512 MB RAM and 100 GB disk). And it will be simpler to maintain, easier to program, and approximately 16x more reliable. The partitioned RAM and disk resources make system-wide optimization difficult.

Future expansion is also impractical beyond the first year for the SC architecture, since all the slave nodes are assumed to have identical specs. With Moore’s Law predicting 2x performance increase every 18 months at the same price, it is more effective to replace the computing hardware every 2 to 3 years with a brand-new system rather than to try to buy older nodes to add to an old cluster.

Server vs. PC

Servers are not just big Personal Computers (PCs). Quality server hardware is designed for reliable 24/7 multi-processing and continuous disk access, unlike PC hardware designed for the cost-sensitive consumer market.

Robust server operating systems like Enterprise Linux are designed to simultaneously run dozens of independent programs in multi-user environments and to isolate crashed programs from affecting our programs.

Server programs have fewer restrictions than PC programs designed for easy installation and use by non-experts. Therefore, they can incorporate powerful server modules like Perl, PHP, Ruby on Rails, Apache, and MySQL, but require IT expertise for installation and configuration. 

One important benefit of the server platform is ease of integration, which is increasingly important as the workflow evolves from just the search engine to a full proteomic workflow. 

In contrast, integration can be very complex on the standard Windows operating system. For example, some mass spec software from different vendors cannot co-exist on the same Windows PC. In general, PC software is easy to install but difficult to integrate, while server software tends to be the opposite.

SORCERER-SEQUEST

The SORCERER software architecture was developed from the ground up as a server platform for high-throughput search engines and workflows, with focus on robustness, scripting flexibility, and scalable performance. 

The SORCERER platform is not hard-coded for SEQUEST, but instead is a general-purpose proteomics search platform that uses the scoring subsystem for algorithm customization. (It was initially prototyped with X!Tandem, and later introduced with SEQUEST.)

At the heart of the SORCERER software architecture is the micro-partitioning of a search job into self-contained “micro-jobs” that are distributed and managed by a relational database.

In order to further reduce search time, the protein sequences are re-arranged into a peptide-centric data structure when they are first loaded into the SORCERER and “prepared” for peptide searches. Specifically, protein sequences are pre-digested in silico into unmodified peptides, which are sorted by mass, and partitioned into 0.5 GB chunks call ’seqblobs’.

When a large search job is submitted to the SORCERER, it is added to the queue by the queuing subsystem. The Sorcerer PE Application Layer subsystem partitions each search job into possibly thousands of self-contained micro-jobs, each containing 300 spectra with associated seqblobs. With PTM searches, the same spectra unit may be search against different seqblobs with different mass ranges. (For example, a spectrum with 1000 amu precursor mass may have its unmodified peptide sequence be 1000 amu with no mods, or 920 amu with a single phospho-site.)

All the micro-jobs are recorded in a MySQL relational database. Available CPU cores from either the master or slave nodes will query the database for the next micro-job, and submit the results when completed. 

Since each seqblob contains pre-searched peptide information, each micro-job performs only the scoring function, which is the only part customized to SEQUEST or other search engines. (Before the advent of multi-core CPUs, FPGA subsystems were also used to execute search micro-jobs. Other exotic architectures, such as Nvidia GPUs and the upcoming Intel Larrabee, are also compatible and may be implemented depending on market needs.)

When all the micro-jobs associated with one queue search job is done, the results are aggregated and written out to the file subsystem. As well, an optional MUSE script is run at this time on the output directory. For example, Ascore phospho-site localization can be done with the search results, or additional re-scoring using different user-defined search engines. 

This powerful mechanism also allows algorithm developers to use the SORCERER search as a pre-search function to enrich the peptide candidates to perhaps the top 50 or 500, and then use MUSE scripting to rapidly develop scoring functions to increase accuracy. In particular, algorithm developers can optimize the important scoring functions without needing to first develop the base software to read FASTA files, compute PTM combinations, or perform other necessary but low-value operations.

Applications include the analysis of CID+ETD spectra, whereby the top CID search results are used to drive the ETD search, and MS2/MS3 phosphorylation analysis, whereby associated MS3 spectra may be separately searched in MUSE and re-combined with the MS2 results.

The SORCERER architecture includes a ‘custom’ directory, which has a higher priority than the application directory, to allow knowledgeable developers to substitute and overwrite almost any part of the SORCERER platform. (By confining all customization to this directory, it is simple to revert back to the original factory state.) Therefore, researchers can start with a powerful, functional workflow using a standard SORCERER product, then customize it as needed from simple MUSE scripts to a full re-architecting of major subsystems.

Discovery proteomics research, such as for biomarker discovery, requires advanced “Proteomics 2.0″ analyses for PTMs like phosphorylation, ETD, and quantitation in addition to high-throughput.

With the transfer of the high-throughput SEQUEST Cluster business, the choice for high-throughput data analysis is simplified to one of two SORCERER products, both of which bring powerful “Proteomics 2.0″ capabilities with the integrated MUSE scripting environment.

Many advanced proteomics analyses require some level of customization, so the MUSE scripting can be invaluable. For example, some PTMs of interest occur only on certain residues at a peptide terminus, which can be implemented as a post-search filtering step. Workflow automation, such as the compression and copying of results after search completion, can be easily scripted in MUSE. Indeed, the Ascore phospho-site localization algorithm is scripted entirely within MUSE.

Algorithm developers can quickly experiment with new scoring functions, such as for ETD, PTMs, quantitation, or even replicating other common peptide search engines, by simply re-scoring, say, the top 50 candidate peptides from a Sorcerer search. 

SEQUEST Cluster users who have developed custom interface modules to their workflow can most likely adapt their infrastructure to SORCERER with little or no change.

The SORCERER 2 system will be the product of choice for most high-throughput users. It is a plug-and-play, pre-configured Enterprise Linuxserver. Users can install it in minutes, and immediately use a web browser interface (with a password) from any network PC for uploading and downloading data and submitting search jobs. They will also appreciate the reliability, as many Sorcerer systems in the field have been continuously running for more than a year without downtime.

The SORCERER Enterprise software will be a better fit for high-throughput users who must run software on approved servers within a data center, such as in biopharmaceutical companies or large centralized labs. It can be viewed as an “a la carte” version of the software architecture within the SORCERER 2 IDA, and allows other software to co-exist on the same server. 

The SORCERER Enterprise software can be purchased pre-installed and tested on customer-specified servers. Otherwise, it and its dependent components must be installed and configured by qualified IT staff on qualified powerful servers. As well, the semi-custom nature of the installation and maintenance will result in higher support costs.

Like the SEQUEST Cluster, the SORCERER Enterprise product allows throughput to be increased with additional slave nodes running the SORCERER Enterprise Plus software. Note, however, that each high-performance slave node may be worth 16 nodes of SEQUEST Cluster under common search conditions, so you won’t need as many.

Furthermore, the combination of Thermo Discoverer and Sage-N Research SORCERER provides a powerful, customizable, client-server data analysis platform. Discoverer provides a Windows user interface customizable using the Windows .NET environment, while the SORCERER provides the back-end Enterprise Linux server with MUSE customizability.

See the joint press release at:http://www.sagenresearch.com/news_10.html

If you plan to buy a new Orbitrap or other fast mass spectrometer for discovery proteomics, we would strongly recommend that you include a SORCERER 2 system (or SORCERER Enterprise software if you must run in a data center) in your budget. PC software will not be able to keep up with a frequently used Orbitrap. 

If you have a SEQUEST Cluster that is over 2 years old, we recommend that you upgrade to SORCERER within one year to replace the outdated hardware. And please inquire about the special time-limited upgrade offer to make this transition easier.

We were privileged to have talks by Drs. John Yates (Scripps), Roman Zubarev (Uppsala),Alexander Ivanov (Harvard), Sean Beausoleil (Harvard Med), and Aaron Klammer (U. Washington) on large-scale quantitative analysis, ETD, and phosphorylation (and other PTMs). 

These talks offer a glimpse into the upcoming capabilities of the Sorcerer 2 platform. 


“Proteomics 2.0″ Users Meeting Group Picture


Dr. John Yates, at right, is presented by David Chiang with a Nintendo Wii, which was given to all 5 speakers this year.


Dr. Nick Morrice (U. Dundee) won one of the three Wii door prizes. Other lucky winners were Drs. Lynn Spruce (Childrens Hospital of Philadelphia) and Patrick Everley (Harvard Med, now in US Army). Eight Wii systems were given away in all.

Find out why all the participants were so excited about this special meeting by listening to the talks! Stay tuned to this space for how to download the talks, as well as for a chance to attend our meeting next year. (The 2009 “Proteomics 2.0″ Users Meeting will again be by invitation only.)

The following are a few of the comments from the meeting participants.


“Sage-N is in tune with the constantly evolving needs of proteomics labs throughout the world.”
Sean Beausoleil (Harvard Medical School)

“I find the Sorcerer convenient to use and when I dig carefully into selected data, I feel confident in the large-scale results from Sorcerer.”
Larry Brill (Burnham Institute)

“It was one of the best, if not the best event of this year’s ASMS! Fantastic speakers with talks about highly relevant subjects. I enjoyed it enormously. Thank you again.”
Markus Brosch (Wellcome Trust Sanger Institute, UK)

“Great opportunity to learn from field pioneers in an intimate setting.”
Josh Elias (Harvard Medical School)

“The scope of the meeting is excellent!”
Alexander Ivanov (Harvard School of Public Health)

“Great talks overall. Great prizes! Couldn’t imagine them any better.”
Aaron Klammer (University of Washington)

“Sage-N is enabling the forefront of proteomics research by providing data appliances that are robust, cutting edge, and easy to use.”
Mark Pitman (Geneva Bioinformatics SA)

“I thought you lined up a great list of speakers on useful current topics of interest. We found that Sage-N’s Sorcerer 2 product is a key component at our facility’s informatics system. The meeting was really worth my time! I was going to hop around, but I stayed on.”
Alexander B. Schilling (University of Illinois Chicago)

“A-list speakers - well worth the time.”
Lynn Spruce (Children’s Hospital of Philadelphia)

“It is a very nice small environment for people learn about the company and products.”
Ru Wei (Pfizer)

“Sorcerer provides rapid protein identification with minimal IT support requirement. It is a highly efficient tool for proteomics studies.”
Wenhong Zhu (Burnham Institute)

Sage-N Research will host a mini workshop focusing on analyses of phosphorylation, large dataset, and ECD/ETD as discussed by top experts. This meeting will be invaluable for gaining insider insights into advanced proteomics technology, as well as provide a glimpse into the future direction of the SORCERER platform.

Speakers include two Biemann Medal winners: Dr. John Yates and Dr. Roman Zubarev, as well as Ascore co-inventor Dr. Sean Beausoleil, and large dataset experts Dr. Michael MacCoss and Dr. Alexander Ivanov.

Please come to the meeting by 1:30pm to be eligible to win one of three hard-to-obtain Nintendo Wii game consoles! (This meeting is open to in-warranty Sorcerer clients and invited guests only.)

Announcing Sage-N Research Users Meeting 2008

“Proteomics 2.0: Putting the Pieces Together”
Sunday, June 1, 2008
Denver Marriott City Center
1701 California St., Downtown Denver (303-297-1300)
“Mattie Silks” Room, Lower Level 1
1:30pm to 5:00pm

Preliminary Agenda

1:30 Welcome Remarks
David Chiang (Sage-N Research)

1:40 Plenary Talk 
Dr. John R. Yates, III (Scripps Research Institute)

2:15 “Linking Proteomics to Systems Biology” 
Dr. Alexander Ivanov (Harvard School of Public Health)

2:45 Break

3:00 “ASCORE for Large-Scale Phosphorylation Site Localization” 
Dr. Sean Beausoleil (Gygi Lab, Harvard Med)

3:30 “Proteomic Profiling using Mass Spectrometry: A delicate balance between swimming and drowning in data”
Dr. Michael MacCoss and Dr. Aaron Klammer (University of Washington)

4:00 Break

4:15 “CAD/CID and ECD/ETD: Solo or duet?”
Dr. Roman Zubarev (Uppsala University)

4:45 Closing Remarks

5:00 Meeting End

This “Proteomics 2.0″ meeting is hosted for the benefit of our in-warranty SORCERER clients, whose generous support makes this meeting and continuing product improvements possible. Recent purchasers of Scaffold software from Sage-N Research are also eligible to attend.

Attendance is by invitation only and must be confirmed prior to the meeting. Space is very limited by room size to less than 30 clients and special guests. 

Please reserve your space soon, by emailing your intention to attend to: “usermeeting@SageNResearch.com“. Confirmations will be sent by return email.

Note: If you are currently not an in-warranty SORCERER client, you can also be eligible to attend the meeting if you purchase a Scaffold PC software license from Sage-N Research before the meeting.

There are a number of good presentations available for download from the National Cancer Institute’s 15 Nov 2007 workshop on “Strategies for Improving Reliability in Protein/Peptide Identification”:

http://proteomics.cancer.gov/news_events/meetings_events/agenda_2007_11_15.asp

Ron Beavis (U. British Columbia) gave the plenary lecture “Proteomics 2.0 - better, faster, cheaper” which nicely captures some of the challenges forward.

Other speakers include Karl Clauser (Broad), David Tabb (Vanderbilt), Zhongqi Zhang (Amgen), Henry Lam (ISB), Alexey Nesvizhskii (U. Michigan), Lewis Geer (NCBI), Sean Seymour (ABI), Brian Searle (Proteome Software), and Phil Andrews (U. Michigan).

Analyzing millions of spectra, quantitation by spectral counting and improved filtering — these are just a few of the new features in the just-released version 2 of the Scaffold software. Currently supported Sage-N customers who are users of the stand-alone Scaffold program can update to the new software immediately, and users of the Scaffold Batch program that is bundled with Sorcerer can look forward to these benefits in the integrated Scaffold analysis shortly.

(Scaffold Batch, which is the Scaffold program that is integrated with Sorcerer, is an enhanced, scriptable version of the more familiar Scaffold desktop, GUI-based program. It provides many of the same capabilities, but it is designed for automated use with no need for manual intervention when loading and analyzing the data. It is supplied as a premium product configuration that also comes with the standard desktop Scaffold program.)

With the current version 1.7 software, a Scaffold analysis is practically limited to about 500,000 spectra. That sounds like a lot, and it’s certainly enough even for moderately large MudPIT analysis, but Sorcerer itself is capable of searching millions of spectra, as some larger experiments require. So we at Sage-N Research are excited about the new version of Scaffold, V2, which can scale to these sorts of experiments. The method involves ‘thinning’ the spectral data so that poor quality data, which does not contribute to the statistics of the results analysis, is excluded from Scaffold. In future Sorcerer software releases, we expect to provide this feature for automated Scaffold Batch analysis, but for now, it can be applied manually when importing data into Scaffold interactively.

The Scaffold file format for version 2 has changed, but the program recognizes the old format and can convert it. So a user can generate a file using the existing version on Sorcerer, and then can open it interactively in the new version to use the new features. One potentially useful feature is the ability to merge Scaffold files. For Sorcerer users who have asked about comparing the results of multiple Sequest searches in one Scaffold analysis, this offers a method for doing it after searches have run. Other new features that may be especially useful to Sorcerer users include peptide filters by precursor mass variance and by post-translational modification, and Receiver-Operator Curves for peptide false positives.

To use Scaffold V2, Customers will need to download new software from http://www.proteomesoftware.com/Proteome_software_prod_Scaffold_download-main.html and they will need a new license key, which is available from Sage-N Research for those customers under support. Purchasers of standard Scaffold for individual computers may upgrade immediately. Upgrade license keys for Scaffold Batch are also available to current customers upon request, but as Sage-N is currently testing the interoperation of that software with Sorcerer and has already uncovered certain issues, we recommend that most users wait until the validation is complete.