Data Preparation > Database > ISI > Extract Document Source Citation Network (Core Only)
Extracts the Source citation network from an ISI database.
Each Source in the input database is represented by a node. An edge is drawn between the nodes for two Sources if and only if one of the Sources is the source of a Document that cited a Document, the Source of which is the other Source. (So, the citation is via Documents.)
Core Source vs. Non-Core Source
There is a distinction drawn between Sources contained in your dataset and Sources in general. A Source which is in your dataset is called a "Core Source".
The output network of this algorithm will contain nodes representing only Core Sources. For an algorithm that will represent even Non-Core Sources, see Extract Document Source Citation Network (Core and References).
The output network will include the following data and metadata:
- Node (Source)
- All actual data from the Sources Table.
- A generated prettified label identifying this Source.
- Edge (Co-Citation)
- The number of Documents in the target node Source which cited Documents in the source node Source.
Load an ISI file into the tool, then create a database from it using the ISI database loader.
It is strongly recommended that the database be cleaned before extracting any citation networks from it.
For a quick analysis of a small dataset you may wish to merge together Author entities with identical names. For a scientifically sound analysis of a larger dataset, you can find Author entity merging suggestions (or manually set your own merging orders from scratch) and perform the merge.
Then, you will probably want to merge together Source entities according to recognized variants.
Finally, you must match References up to Documents in your dataset (there are no citations to analyze, otherwise).
The specific query run by the tool can be found in the source code.