UCSF banner
Property Subsets
Ready-to-download subsets of ZINC filtered by physical properties are available below. To go to the download page, click on the subset name. To browse sample compounds, click on the number of compounds. Three versions of subsets are available:
  • Standard subsets, numbers 1-10 (for delivery in 1-10 weeks)
  • "clean" subsets, numbers 11-20, in which stricter filtering rules have been applied
  • "immediate availability subsets", numbers 21-30 (for delivery in 7-14 days max)
More about ZINC subsets is here. You can also create your own minisubset. ZINC may be used free of charge for research by individuals and institutions. Whereas you are free to share the results of a ZINC search or a screen of molecules from ZINC, you may not redistribute major portions of ZINC without the express written permission of John Irwin.
Subset
Click to download
and Description
Compounds
Click to browse
Last UpdateSelection criteriaOnly one sourceT<.9T<.8T<.7T<.6
 lead-like
(#1)

Teague, Davis, Leeson, Oprea, Angew Chem Int Ed Engl. 1999 Dec 16;38(24):3743-3748.

34724612010-05-20p.xlogp < 3.5 and p.mwt < 350 and p.rb <= 71490795 2158281247713528910083
 fragment-like
(#2)

Carr RA, Congreve M, Murray CW, Rees DC, Drug Discov Today. 2005 Jul 15;10(14):987

4067302010-05-19p.xlogp <=2.5 and p.mwt <=250 and p.rb <= 575797 13803051261194667109
 drug-like
(#3)

Lipinski, J Pharmacol Toxicol Methods. 2000 Jul-Aug;44(1):235-49.

134415462010-06-17p.xlogp <= 5 and p.mwt <= 500 and p.mwt > 150 and p.rb < 8 and p.psa < 150 and p.n_h_acceptors <= 107974651 N/AN/AN/AN/A
 all-purchasable
(#6)

Purchasable chemical space

206693722010-06-1712470677 N/AN/AN/AN/A
 everything
(#10)

All public molecules in ZINC, including around 1M compounds not in subset #6 that may not be available commercially

351257922010-06-1716641986 0000
 clean-leads
(#11)

As Subset #1, but without 'yuck' compounds.

14427162009-12-11p.xlogp < 3.5 and p.mwt < 350 and p.rb <= 71011096 30296593832247876500
 clean-fragments
(#12)

As Subset #2, but without 'yuck' compounds.

1207272009-11-12p.xlogp <=2.5 and p.mwt <=250 and p.rb <= 558814 N/AN/AN/AN/A
 clean-drug-like
(#13)

As Subset #3, but without 'yuck' compounds.

94975422009-11-13p.xlogp <= 5 and p.mwt <= 500 and p.mwt >=150 and p.n_h_donors <= 5 and p.n_h_acceptors <= 107973055 569344149662352898535
 leads-now
(#21)

Teague, Davis, Leeson, Oprea, Angew Chem Int Ed Engl. 1999 Dec 16;38(24):3743-3748.

19498282010-03-26p.xlogp < 3.5 and p.mwt < 350 and p.rb <= 7205617 N/AN/AN/A14673
 frags-now
(#22)

Carr RA, Congreve M, Murray CW, Rees DC, Drug Discov Today. 2005 Jul 15;10(14):987

3604532010-03-19p.xlogp <=2.5 and p.mwt <=250 and p.rb <= 549113 12097647717187797021
 drugs-now
(#23)

Lipinski, J Pharmacol Toxicol Methods. 2000 Jul-Aug;44(1):235-49.

62807872009-07-03p.xlogp <= 5 and p.mwt <= 500 and p.mwt > 150 and p.rb < 8 and p.psa < 150 and p.n_h_acceptors <= 102223051 2419801005104379616384
 all-now
(#26)

Immediately purchasable chemical space

92400232009-09-083245702 N/AN/AN/AN/A
 goldilocks
(#33)

not too big, not too small, not too polar, not too greasy, uncharged, just right

7226652009-07-08p.net_charge between -1 and 1 and p.mwt between 200 and 300 and p.xlogp between 1 and 3 and p.rb < 650131 N/AN/AN/AN/A
 sarah
(#37)

fragments for model systems

6816932009-07-08p.mwt between 30 and 250295156 N/AN/AN/AN/A
 clean CNS permeable
(#39)

more likely to pass the blood-brain barrier

3445962009-07-03p.psa between 0 and 60 and p.mwt between 150 and 400 and p.xlogp between 1.5 and 2.7215552 9598536494127764289
 flagments
(#77)

between fragments and leads; as requested by denise

9819702009-02-24p.xlogp < 3 and p.mwt < 300 and p.rb < 6651905 N/AN/AN/AN/A
 lugs
(#78)

between leads and drugs; as suggested by denise

54952422009-04-03((p.xlogp < 4 and p.mwt < 400 and p.rb < 10) AND NOT (p.xlogP < 3 and p.mwt < 350 and p.rb<=7))2710556 N/AN/AN/AN/A
 jens
(#79)

NHR docking set

15438432009-10-26p.xlogp<5 and p.mwt between 400 and 500568769 N/AN/AN/AN/A
 missingleads
(#80)

missing greasy leads

6013162009-10-26p.xlogp between 4 and 5 and p.mwt between 350 and 400439644 N/AN/AN/AN/A

The following limits are in effect:
  • Molecules per mol2 or SDF format file: 100,000
  • Entries per flexibase file 20,000
  • Typical size of mol2/sdf/flexibase files: 20 to 100 MB
  • Number of large files that may be downloaded concurrently: 2
In the downloads above, we offer four formats: SMILES, mol2, SDF, and flexibase formats. Most docking programs we know of use mol2. We urge you to not download Flexibaseunless you are using DOCK3.5.54.

We have generated protonated forms in 3 pH ranges. The reference structure ("ref") is a single representative structure at pH 7. The middle pH range ("mid") contains additional protonated forms obtained by titrating protons with pKa between 5.75 and 8.25. The low pH range ("lo") contains additional protonated forms between pH 4.5-7 and the high pH range ("hi") contains additional protonation forms between pH 7-9.5.

Databases are divided into chunks for easier download. mol2 and sdf slices are 100MB or less uncompressed (often 30MB compressed, containing 20K to 30K molecules each Flexibase are 200MB or less uncompressed.

To download all files in the usual pH range (5.75-8.25) use "Usual". To download databases for metalloenzymes use "Metal", which adds the pH range 7-9.5 to usual above. For the entire pH range 4.5-9.5 please use "All".

We also offer tables of Molecular Properties, purchasing information, and Inchi files.

Last updated: Jan 30, 2007 by jji at cgl.ucsf.edu.

A product of BCIRC, the Bioinfomatics and Chemical Informatics Research Center @ UCSF. Last updated Aug 6, 2009. questions and discussion to zinc-fans at docking.org; bug reports to support at docking.org; any other correspondence to comments at docking.org. Terms of use. Privacy policy.