Image Ranking

Source code and test script of SILA image ranking module.

Installation

Install Git.
Install Git Large File Storage.
Install Docker.

Check the project out from GitLab.

git lfs clone --branch content-ranking https://github.com/danielmoreira/sciint.git sci-ranking

Build the Docker container. In a terminal, execute:

cd sci-ranking; docker build . -t sci-ranking:latest

Create an IO folder for the container input-output. Please change the location of the folder within your machine accordingly.
```
export RANK_IO=~/RANK_IO; mkdir -p $RANK_IO
```

Start the container.

docker run --rm -ti -v $RANK_IO:/ranking/io --name sci-ranking sci-ranking:latest

Test Execution

Download the test data zip file. Contact Daniel Moreira to get and unlock it.
Move the test data to the container IO folder (see item 6 of the section above).

Run (in a terminal):

docker exec sci-ranking /ranking/01_query_all_images.sh

Get the output data from the container IO folder.

Input and output data are explained in the following.

Test (Input and Output) Data

The test data consists of 2,843 scientific image panels extracted from 48 distinct articles. Each one of the image panels will be used as a query to retrieve, among the remaining 2,842 image panels, which are the top-500 most visually similar panels to the query. Please contact Daniel Moreira to obtain and unlock the test data.

All the 2,843 queries will be processed individually and sequentially, using all the CPU cores available in the host machine. The 2,843 queries will lead to the computation of 2,843 image panel ranks, which are stored as text files whose paths are the same of their respective queries added with the ".txt" extension. Each rank file will store a list of up-to 500 image panels (among the 2,842 available ones) through their file paths, one file path per line, from the most to the least similar image panel to the query.

In the case of image panel "10.1002_cncr.21731/figures-panels/fig1_000.png" as the query, for example, the respective rank will be stored in "10.1002_cncr.21731/figures-panels/fig1_000.png.txt" and its content will look like:

rank_images/10.1002_cncr.21731/figures-panels/fig1_000.png,1.0000000000
rank_images/10.1002_cncr.21731/figures-panels/fig4_001.png,0.0105644072
rank_images/10.1124_mol.107.041350/figures-panels/fig2_000.png,0.0050476437
rank_images/10.1002_cncr.21731/figures-panels/fig2_000.png,0.0040741775
rank_images/10.1158_0008-5472.CAN-04-4604/figures-panels/fig4_004.png,0.0027281176
# (...)

Metrics

To compare the computed and ground-truth image ranks, we calculate the precision at the top-N retrieved images (P@N, with N in {1, 5, 10}, hence P@1, P@5, and P@10), for each one of the queries.

By definition, P@N belongs to real interval [0.0, 1.0], and we want it as close to 1.0 as possible. An implementation of these metrics is properly made available within src/compute_precision.py.

Metric Collection

To assess the metrics above and obtain the mean and standard deviation for each one of them with respect to the test data:

Execute the test (described above) and generate all the 2,843 output ranks.
Run (in a terminal, with the image ranking container properly started):
```
docker exec sci-ranking /ranking/02_eval_all_ranks.sh
```

The result will be:

Metric	Mean (Std)
P@1	0.5860007034822371 (0.4925483519417767)
P@5	0.41238128737249385 (0.31746233650357253)
P@10	0.3211748153359128 (0.2365817038758192)

Cite this Work

Please cite as:

Moreira, D., Cardenuto, J.P., Shao, R. et al. SILA: a system for scientific image analysis. Nature Scientific Reports 12 (18306), 2022. https://doi.org/10.1038/s41598-022-21535-3

@article{sila,
   author = {Moreira, Daniel and Cardenuto, João Phillipe and Shao, Ruiting and Baireddy, Sriram and Cozzolino, Davide and Gragnaniello, Diego and Abd‑Almageed, Wael and Bestagini, Paolo and Tubaro, Stefano and Rocha, Anderson and Scheirer, Walter and Verdoliva, Luisa and Delp, Edward},
   title = {{SILA: a system for scientifc image analysis}},
   journal = {Nature Scientific Reports},
   year = 2022,
   number = {12},
   volume = {18306},
   pages = {1--15}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
ranking_data		ranking_data
src		src
01_query_all_images.sh		01_query_all_images.sh
02_eval_all_ranks.sh		02_eval_all_ranks.sh
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
ranking-example.png		ranking-example.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ranking_data

ranking_data

src

src

01_query_all_images.sh

01_query_all_images.sh

02_eval_all_ranks.sh

02_eval_all_ranks.sh

Dockerfile

Dockerfile

LICENSE

LICENSE

README.md

README.md

ranking-example.png

ranking-example.png

requirements.txt

requirements.txt

Repository files navigation

Image Ranking

Installation

Test Execution

Test (Input and Output) Data

Metrics

Metric Collection

Cite this Work

About

Releases 1

Packages

License

danielmoreira/sciint

Folders and files

Latest commit

History

Repository files navigation

Image Ranking

Installation

Test Execution

Test (Input and Output) Data

Metrics

Metric Collection

Cite this Work

About

Resources

License

Stars

Watchers

Forks