|
|
Materials and MethodsTraining and test data sets Two sets of breast cancer samples were used in this study: 73 samples in the training data set, and 22 samples in the independent test data set. All tissue sections were stained with the PathVysion® Her2 FISH probe set (Vysis, Downers Grove, USA), comprising LSI®-Her2 (orange) and CEP-17 (green), and counterstained with DAPI. The scanning was performed using a Metafer4-MetaCyte microscope scanning system. For all samples, tumour regions suitable for analysis were interactively located using a trackball device to move the scanning stage. Then images were fully automatically captured using a 40x/0.75 dry objective, using the list of positions defined in the initial interactive step. In the Her2 and CEP17 image channels, extended focus images were computed from 9 focus plane images captured with a spacing of 0.75 µm. To provide ground-truth ratio data, all training and test samples were visually scored according to [1]. Finally, regions of interest containing at least 75% tumour cells were interactively defined. Placing of tiles The tile placing algorithm tries to include as much cellular material and as little empty space as possible in the tiles. The first tile is placed at the position within the image which gives the maximum total DAPI intensity in the tile. The position of the second tile is determined using the same criterion, with the additional condition that it does not overlap with the first one. This procedure is continued until the total DAPI intensity of the next potential tile drops below a certain percentage of that of the first tile. HSR classification As mentioned above we are employing different spot counting algorithms for non-HSR and for HSR samples. This makes it necessary to classify the entire sample automatically before spot counting. We do this by analysing the distribution of the object areas in the Her2 channel. Objects can be single diffraction limited FISH spots or HSRs, which are significantly larger. The best discrimination of HSRs was found for the total number of objects larger than 0.7 µm˛, divided by the total number of all objects. Spot counting In non-HSR samples Her2 FISH signals are seen as diffraction limited spots with a diameter of about 3 pixels. The following tile image processing was found to give the best spot counting performance in the training data set: 1. a Gaussian smoothing filter; 2. a TopHat filter for local background correction; 3. a Laplacian sharpening filter for enhancing the spots; 4. another Gaussian smoothing filter; 5. application of the counterstain mask. After this processing the number of objects at 40% relative intensity is used as spot count. In HSR samples all algorithms based on Her2 signal counting gave quite poor results. Therefore we decided to use a total area measurement in the orange channel and estimate the Her2 signal count from it by quadratic regression. The optimum tile image processing we found was: 1: a TopHat filter for local background correction; 2. application of the counterstain mask. The area was then measured at 18% relative intensity. The CEP17 signals are typical centromere signals: they are larger and brighter, but have a greater tendency to be spread out or split. The optimum tile image processing in this case is: 1. a Gaussian smoothing filter; 2. a TopHat filter for local background correction; 3. a Laplacian sharpening filter for enhancing the spots; 4. application of the counterstain mask; 5. a Spot" averaging filter with a mask area of 0.36 µm˛. Then the objects were counted at a relative intensity of 12%. If the distance of two signals was less than 0.5 µm, only one of them was included in the final count.
|
Results and DiscussionThe algorithms described above were applied to all training (73 samples) and test (22 samples) data. Of the 95 samples, 19 (20.0%) were rejected due to slide quality. Of the remaining 76 samples, 73 (96%) were correctly classified as amplified" or not amplified", and 3 (4%) were classified false negative. The results are shown graphically in figure 1. Compared to manual scoring the errors of the automatic system are comparable to the discordance between two different observers. Nevertheless there are continuing efforts to reduce the errors and the sample reject rate. Of the 3 false negative classifications, 2 are very close to the threshold ratio value of 2.0. They would not cause a wrong clinical result as all ratios within an interval around 2.0 (e.g. between 1.8 and 2.2) are interactively checked before the final ratio is reported. Figure 2 shows a part of an example Metafer4 tile gallery of an unamplified sample, Figure 3 of an amplified sample with HSR. The number displayed in red for each tile is the Her2 spot count, the number in green the CEP17 spot count, and the number in white the Her2/CEP17 ratio value. |
ReferencesPathVysion Her2 Package Insert. |
(c) 2004 by MetaSystems