Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

improve MIC-wikidata alignment #8

Open
VladimirAlexiev opened this issue Aug 19, 2022 · 3 comments
Open

improve MIC-wikidata alignment #8

VladimirAlexiev opened this issue Aug 19, 2022 · 3 comments

Comments

@VladimirAlexiev
Copy link

I added MIC to Wikidata. Last update was maybe 1y ago. Spent a lot of time on matching existing exchanges.
I think most items are ok, but I'm sure there are omissions and merges that still need to be done.

  • Do you get your alignments skos:closeMatch wd:Qnnn directly from WD?
  • Would you like to help with improving the situation in WD?
  • Below is some info on ambiguous/duplicate codes, but I have a lot more files locally.
    • In particular, we need to check which current MIC codes are still missing from WD.
      A year ago they were COTC|ECHO|JSES|JSJX|LMEC|MIBG|NMRA|OTCN|TPSL|TSGI|XSWB|XZAP

MIC and Crunchbase Exchange IDs on Wikidata

Table of Contents

Results and counts as per 17-Aug-2022

MIC Codes

Duplicate MIC

Two MICs per exchange: https://w.wiki/5agv: 7

item itemLabel mic1 mic2
wd:Q21072594 Equilend EQIE EQLD
wd:Q5973741 FXCM FXCM FXGB
wd:Q25245262 Hudson River Trading HRTF HRTX
wd:Q795936 Oddo BHF ODDO ODOC
wd:Q7829724 Tower Research TRCX TRSI
wd:Q60742651 XTX Markets XTXE XTXM
wd:Q66320 over-the-counter trading BILT XOFF

Ambiguous MIC

One MIC for two exchanges: https://w.wiki/5ah$ : 3 (now fixed all)

Crunchbase Exchanges

Out of 164 CB exchange codes.

Ambiguous Crunchbase Code

Same code for two exchanges: https://w.wiki/5aYS : 1

item1 item1Label mic1 item2 item2Label mic2 cb
wd:Q1752885 Mongolian Stock Exchange XULA wd:Q43080281 Metropolitan Stock Exchange MCXX mse

Duplicate Crunchbase Code

Two CB codes per exchange: https://w.wiki/5aYP : 8 (this is ok)

item itemLabel cb1 cb2 mic
wd:Q5013179 Canadian Securities Exchange cnsx cse XCNQ
wd:Q43080281 Metropolitan Stock Exchange mse msei MCXX
wd:Q151139 Frankfurt Stock Exchange fra fwb FRAA
wd:Q496672 Hong Kong Stock Exchange hkg sehk XHKG
wd:Q661834 SIX Swiss Exchange six swx XSWX
wd:Q824533 Berne eXchange bx sbx XBRN
wd:Q846626 NYSE American amex nysemkt XASE
wd:Q2068453 Belarusian Currency and Stock Exchange bvfb jsc BCSE

Crunchbase Without MIC

Crunchbase exchanges without MIC: https://w.wiki/5aYM : 13

item itemLabel cb
wd:Q111263879 Afghanistan Stock Exchange afx
wd:Q28129219 ALTX East Africa Exchange altx
wd:Q4670553 Abuja Securities and Commodities Exchange asce
wd:Q1003245 Bucharest Stock Exchange bvb
wd:Q5072409 Channel Islands Stock Exchange cise
wd:Q702192 Eurex eurex
wd:Q111263885 Iran Energy Exchange irenex
wd:Q111263892 Lusaka Securities Exchange luse
wd:Q23308122 National Equities Exchange and Quotations neeq
wd:Q7374819 Royal Securities Exchange of Bhutan rsebl
wd:Q7914256 Vancouver Stock Exchange vse
wd:Q20992132 Yangon Stock Exchange ysx
wd:Q111263909 Zambian Commodity Exchange zamace
@hroptatyr
Copy link
Contributor

Thanks, for the overview. I added some of the obvious ones, XVSE (Q7914256), XBSE (Q1003245), XLUS (Q111263892). But stopped there because items appear to need merging or scope clarification.

Lusaka Stock is now Lusaka Securities, for the same reason you cannot find the Zambian Commodity Exchange (ZAMACE), it got merged into LuSE, like years ago.

For XBSE there's Q93358786 as well as Q1003245. Here the question is how to proceed, BVB operates the spot market (equities, bonds) as well as a derivatives market (mic XBSD), so the uniqueness constraint makes no sense. I chose to put XBSE in Q1003245 because XBSE is also the operator of both segments.

I'm willing to help but there's not a lot of guidance on wd on how to reflect which resource covers what.

Oh, Oddo BHF is quite the same, they use ODDO for everything except OTC or RFQ trades, where they use ODOC.

@VladimirAlexiev
Copy link
Author

VladimirAlexiev commented Aug 24, 2022

  • linked Zambian Commodity Exchange to Lusaka Stock Exchange
  • merged Q93358786 to Q1003245 as "Stock exchange Bucharest, Romania and its main SPOT REGULATED MARKET"
  • ODOC "ODDO CONTREPARTIE" should be split from ODDO "ODDO BHF", but I haven't done it.

@hroptatyr more important are the omissions.
Total unique MICs in WD: https://w.wiki/5cRB : 1944.
Total uses as main statement: https://www.wikidata.org/wiki/Property_talk:P7534 : 1948.

This should be compared to the total in the MIC distribution, which are:

  • 2021-09: 2069
  • 2021-11: 2469
  • 2022-08: 2536

I've added these counts to https://www.wikidata.org/wiki/Property:P7534#P4876.
So now https://www.wikidata.org/wiki/Property_talk:P7534 shows "1,948 out of 2,536 (77% complete)".

Would you like to help me match and import the new MIC records to WD, so we have 100% coverage again? I do this with OpenRefine. Actually I'll talk to some colleagues.

@hroptatyr
Copy link
Contributor

Oh certainly I would.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants