2.51 score from hupso.pl for:
ouseful.info



HTML Content


Titleouseful.info, the blog... | trying to find useful things to do with emerging technologies in open education and data journalism

Length: 127, Words: 20
Description trying to find useful things to do with emerging technologies in open education and data journalism

Length: 99, Words: 16
Keywords pusty
Robots
Charset UTF-8
Og Meta - Title exist
Og Meta - Description exist
Og Meta - Site name exist
Tytuł powinien zawierać pomiędzy 10 a 70 znaków (ze spacjami), a mniej niż 12 słów w długości.
Meta opis powinien zawierać pomiędzy 50 a 160 znaków (łącznie ze spacjami), a mniej niż 24 słów w długości.
Kodowanie znaków powinny być określone , UTF-8 jest chyba najlepszy zestaw znaków, aby przejść z powodu UTF-8 jest bardziej międzynarodowy kodowaniem.
Otwarte obiekty wykresu powinny być obecne w stronie internetowej (więcej informacji na temat protokołu OpenGraph: http://ogp.me/)

SEO Content

Words/Characters 5892
Text/HTML 25.05 %
Headings H1 8
H2 8
H3 14
H4 0
H5 0
H6 0
H1
ouseful.info, the blog...
tracking down data files associated with parliamentary business
data cleaning – finding near matches in names
talking to developers and civic hackers on their own terms…
tinkering with parliament data apis: commons written questions and parliamentary written answers
experimenting with sankey diagrams in r and python
grouping numbers that are nearly the same – casual clustering
transparency in parliament… and in data journalism?
H2
freedom of information related datasets
written questions
written statements
deposited papers
committee papers
parliamentary research briefings
summary
so what?
H3
sources of “interests” data from parliament
others sources of data relating to members’ parliamentary and government activities
other sources of information about members’ external interests
contact
email subscription
my other blogs
digital worlds blog
custom search engines
page hacks
@psychemedia tweets
tumbling…
recent posts
top posts
archives
H4
H5
H6
strong
ws-attachments
qna-attachments
nearly
b
i
em ws-attachments
qna-attachments
nearly
Bolds strong 3
b 0
i 0
em 3
Zawartość strony internetowej powinno zawierać więcej niż 250 słów, z stopa tekst / kod jest wyższy niż 20%.
Pozycji używać znaczników (h1, h2, h3, ...), aby określić temat sekcji lub ustępów na stronie, ale zwykle, użyj mniej niż 6 dla każdego tagu pozycje zachować swoją stronę zwięzły.
Styl używać silnych i kursywy znaczniki podkreślić swoje słowa kluczowe swojej stronie, ale nie nadużywać (mniej niż 16 silnych tagi i 16 znaczników kursywy)

Statystyki strony

twitter:title pusty
twitter:description pusty
google+ itemprop=name pusty
Pliki zewnętrzne 24
Pliki CSS 4
Pliki javascript 20
Plik należy zmniejszyć całkowite odwołanie plików (CSS + JavaScript) do 7-8 maksymalnie.

Linki wewnętrzne i zewnętrzne

Linki 188
Linki wewnętrzne 6
Linki zewnętrzne 182
Linki bez atrybutu Title 160
Linki z atrybutem NOFOLLOW 0
Linki - Użyj atrybutu tytuł dla każdego łącza. Nofollow link jest link, który nie pozwala wyszukiwarkom boty zrealizują są odnośniki no follow. Należy zwracać uwagę na ich użytkowania

Linki zewnętrzne

ouseful.info, the blog... https://blog.ouseful.info/
about https://blog.ouseful.info/about/
editorial policy https://blog.ouseful.info/editorial-policy/
search https://blog.ouseful.info/search/
tracking down data files associated with parliamentary business https://blog.ouseful.info/2017/03/23/tracking-down-data-files-associated-with-parliamentary-business/
using foi response on whatdotheyknow as an “as if” open datastore https://blog.ouseful.info/2012/04/28/the-foi-route-to-real-fake-open-data-via-whatdotheyknow/
foi responses http://www.parliament.uk/site-information/foi/
https://www.parliament.uk/documents/foi/ https://www.parliament.uk/documents/foi/
- https://www.google.co.uk/search?q=(filetype%3axls+or+filetype%3acsv+or+filetype%3axlsx)+site%3ahttps%3a%2f%2fwww.parliament.uk%2fdocuments%2ffoi
written questions http://www.parliament.uk/business/publications/written-questions-answers-statements/written-questions-answers/
http://qna.files.parliament.uk/qna-attachments http://qna.files.parliament.uk/qna-attachments
- https://www.google.co.uk/search?q=(filetype%3axls+or+filetype%3acsv+or+filetype%3axlsx)+site%3ahttp%3a%2f%2fqna.files.parliament.uk%2fqna-attachments
numeric key for a particular question https://blog.ouseful.info/2017/03/18/tinkering-with-parliament-data-apis-commons-written-questions-and-parliamentary-written-answers/
written answers api http://explore.data.parliament.uk/?endpoint=commonsansweredquestions
http://qna.files.parliament.uk/qna-attachments/454264/original/28152%20-%20table.xlsx http://qna.files.parliament.uk/qna-attachments/454264/original/28152%20-%20table.xlsx
- http://www.parliament.uk/writtenstatements
http://qna.files.parliament.uk/ http://qna.files.parliament.uk/
http://explore.data.parliament.uk/ http://explore.data.parliament.uk/
- https://www.parliament.uk/depositedpapers
http://data.parliament.uk/depositedpapers/files/ http://data.parliament.uk/depositedpapers/files/
example http://data.parliament.uk/writtenevidence/committeeevidence.svc/evidencedocument/education-committee/purpose-and-quality-of-education-in-england/written/46460.html
- http://researchbriefings.parliament.uk/
- http://researchbriefings.parliament.uk/researchbriefing/summary/sn06643
http://researchbriefings.files.parliament.uk/ http://researchbriefings.files.parliament.uk/
research briefings api http://explore.data.parliament.uk/?learnmore=research%20briefings
- http://lda.data.parliament.uk/researchbriefings.json?identifier=sn06643
- http://lda.data.parliament.uk/researchbriefings/345630.json
tony hirst https://blog.ouseful.info/author/psychemedia/
leave a comment https://blog.ouseful.info/2017/03/23/tracking-down-data-files-associated-with-parliamentary-business/#respond
anything you want https://blog.ouseful.info/category/anything-you-want/
parlidata https://blog.ouseful.info/tag/parlidata/
data cleaning – finding near matches in names https://blog.ouseful.info/2017/03/20/data-cleaning-finding-near-matches-in-names/
what nationality did you say you were, again? https://blog.ouseful.info/2016/11/14/what-nationality-did-you-say-you-were-again/
fuzzyset https://github.com/axiak/fuzzyset
fco register of country names https://country.register.gov.uk/
tony hirst https://blog.ouseful.info/author/psychemedia/
2 comments https://blog.ouseful.info/2017/03/20/data-cleaning-finding-near-matches-in-names/#comments
anything you want https://blog.ouseful.info/category/anything-you-want/
talking to developers and civic hackers on their own terms… https://blog.ouseful.info/2017/03/19/talking-to-developers/
lords amendments http://lordsamendments.parliament.uk/
explore.data.parliament.uk http://explore.data.parliament.uk/
james bridle http://booktwo.org/
the new aesthetic http://new-aesthetic.tumblr.com/
- http://booktwo.org/notebook/welcome-js/
welcome.js http://booktwo.org/notebook/welcome-js/
here https://blog.ouseful.info/2017/03/14/computer-spirits/
meet the artist using ritual magic to trap self-driving cars https://creators.vice.com/en_us/article/meet-the-artist-using-ritual-magic-to-trap-self-driving-cars
search engine powered courses… https://blog.ouseful.info/2011/09/15/search-engine-powered-courses/
tony hirst https://blog.ouseful.info/author/psychemedia/
leave a comment https://blog.ouseful.info/2017/03/19/talking-to-developers/#respond
anything you want https://blog.ouseful.info/category/anything-you-want/
tinkering with parliament data apis: commons written questions and parliamentary written answers https://blog.ouseful.info/2017/03/18/tinkering-with-parliament-data-apis-commons-written-questions-and-parliamentary-written-answers/
recipe https://github.com/psychemedia/parlihacks/blob/master/notebooks/writtenquestions.ipynb
output https://psychemedia.github.io/parlihacks/writtenquestions.html
written questions api http://explore.data.parliament.uk/?learnmore=commons%20written%20questions
parliamentary questions http://www.parliament.uk/business/publications/written-questions-answers-statements/written-questions-answers/?page=1&max=20&questiontype=allquestions&house=commons%2clords&use-dates=true&answered-from=2014-06-04&answered-to=2017-03-18&uin=24161
parliamentary questions answered api http://explore.data.parliament.uk/?learnmore=parliamentary%20questions%20answered
explore.data.parliament.uk http://explore.data.parliament.uk/
tony hirst https://blog.ouseful.info/author/psychemedia/
leave a comment https://blog.ouseful.info/2017/03/18/tinkering-with-parliament-data-apis-commons-written-questions-and-parliamentary-written-answers/#respond
anything you want https://blog.ouseful.info/category/anything-you-want/
parlidata https://blog.ouseful.info/tag/parlidata/
experimenting with sankey diagrams in r and python https://blog.ouseful.info/2017/03/17/experimenting-with-sankey-diagrams-in-r-and-python/
visualising migration between the countries of the uk http://olihawkins.com/2017/03/1
internal migration flows in the uk http://olihawkins.com/visualisation/8
- http://olihawkins.com/visualisation/8
googlevis https://www.rdocumentation.org/packages/googlevis/versions/0.6.0/topics/gvissankey
google chart tools https://developers.google.com/chart/interactive/docs/gallery/sankey
d3.js https://bost.ocks.org/mike/sankey/
rcharts sankey diagram demo by @timelyporfolio https://github.com/timelyportfolio/rcharts_d3_sankey
sankeyd3 https://github.com/fbreitwieser/sankeyd3
here https://github.com/psychemedia/parlihacks/blob/master/rmd/sankeyflow.rmd
here https://psychemedia.github.io/parlihacks/sankeyflow.html
ipysankey widget https://github.com/ricklupton/ipysankeywidget
here https://github.com/psychemedia/parlihacks/blob/master/notebooks/migrantflow.ipynb
here https://psychemedia.github.io/parlihacks/migrantflow.html
tony hirst https://blog.ouseful.info/author/psychemedia/
2 comments https://blog.ouseful.info/2017/03/17/experimenting-with-sankey-diagrams-in-r-and-python/#comments
anything you want https://blog.ouseful.info/category/anything-you-want/
rstats https://blog.ouseful.info/category/syndication/rstats/
jupyter https://blog.ouseful.info/tag/jupyter/
parlidata https://blog.ouseful.info/tag/parlidata/
grouping numbers that are nearly the same – casual clustering https://blog.ouseful.info/2017/03/15/grouping-numbers-that-are-nearly-the-same-casual-clustering/
via stackoverflow http://stackoverflow.com/a/14783998/454773
tony hirst https://blog.ouseful.info/author/psychemedia/
3 comments https://blog.ouseful.info/2017/03/15/grouping-numbers-that-are-nearly-the-same-casual-clustering/#comments
anything you want https://blog.ouseful.info/category/anything-you-want/
data https://blog.ouseful.info/category/data/
tinkering https://blog.ouseful.info/category/tinkering/
data https://blog.ouseful.info/tag/data/
wrc https://blog.ouseful.info/tag/wrc/
transparency in parliament… and in data journalism? https://blog.ouseful.info/2017/03/15/transparency-in-parliament-and-in-data-journalism/
parliament ltd http://www.hive.co.uk/product/martin-williams/parliament-ltd--a-journey-to-the-dark-heart-of-british-politics/20176979
register of members’ financial interests http://www.parliament.uk/mps-lords-and-offices/standards-and-financial-interests/parliamentary-commissioner-for-standards/registers-of-interests/register-of-members-financial-interests/
maxharlow/scrape-members-financial-interests https://github.com/maxharlow/scrape-members-financial-interests
register of lords’ interests http://www.parliament.uk/mps-lords-and-offices/standards-and-financial-interests/house-of-lords-commissioner-for-standards-/register-of-lords-interests/
register of interests of members’ secretaries and research assistants https://www.publications.parliament.uk/pa/cm/cmsecret/staff-01.htm
register of journalists’ interests https://www.publications.parliament.uk/pa/cm/cmjournl/journalists.htm
register of all-party groups https://www.publications.parliament.uk/pa/cm/cmallparty/170106/contents.htm
here https://blog.ouseful.info/2012/12/17/organisations-providing-benefits-to-all-party-groups-part-1/
hansard speeches and sentiment http://evanodell.com/datasets/hansard-data/
r code is supplied https://github.com/evanodell/hansard-data
the public whip http://www.publicwhip.org.uk/project/data.php
scraping ministers’ meetings https://blog.ouseful.info/2010/11/12/practical-data-scraping-uk-government-transparency-data-ministers-meetings/
cabinet office: ministers’ transparency publications https://www.gov.uk/government/collections/ministers-transparency-publications
office of the registrar of consultant lobbyists http://registrarofconsultantlobbyists.org.uk/
donations and loans to individuals (including mps) http://www.electoralcommission.org.uk/find-information-by-subject/political-parties-campaigning-and-donations/donations-and-loans-to-other-individuals-and-organisations
candidate spending and donations at elections http://www.electoralcommission.org.uk/find-information-by-subject/political-parties-campaigning-and-donations/candidate-spending-and-donations-at-elections
listed as a company officer https://beta.companieshouse.gov.uk/
person of significant control http://download.companieshouse.gov.uk/en_pscdata.html
the insolvency service https://www.insolvencydirect.bis.gov.uk/iesdatabase/viewdirectorsummary-new.asp
individual insolvency https://www.insolvencydirect.bis.gov.uk/eiir/
ico data protection register https://ico.org.uk/esdwebpages/entry/z4972862
import[ing] data from the charity commission data extract https://github.com/ncvo/charity-commission-extract
advisory committee on business appointments (acoba) https://www.gov.uk/government/organisations/advisory-committee-on-business-appointments
appointments taken up by former ministers https://www.gov.uk/government/collections/appointments-taken-up-by-former-ministers
appointments taken up by former crown servants https://www.gov.uk/government/collections/appointments-taken-up-by-former-crown-servants
wanting to be able to hide, or at least embargo, their requests https://github.com/mysociety/alaveteli/issues/8
other than me of course;-) https://blog.ouseful.info/2017/01/25/data-journalism-units-on-github/
tony hirst https://blog.ouseful.info/author/psychemedia/
leave a comment https://blog.ouseful.info/2017/03/15/transparency-in-parliament-and-in-data-journalism/#respond
anything you want https://blog.ouseful.info/category/anything-you-want/
ddj https://blog.ouseful.info/tag/ddj/
older posts https://blog.ouseful.info/page/2/
- http://creativecommons.org/licenses/by/3.0/
tony hirst http://blog.ouseful.info
follow @psychemedia https://twitter.com/psychemedia
- https://plus.google.com/100095426689697101649/
bookmarks http://pinboard.in/u:psychemedia
presentations http://slideshare.com/psychemedia
- https://blog.ouseful.info/feed
subscribe in a reader https://blog.ouseful.info/feed
f1datajunkie blog http://f1datajunkie.blogspot.com
digital worlds blog http://digitalworlds.wordpress.com
visual gadgets blog http://visualgadgets.blogspot.com
- https://digitalworlds.wordpress.com/feed/
digital worlds blog https://digitalworlds.wordpress.com
augmented reality and autonomous vehicles – enabled by the same technologies? https://digitalworlds.wordpress.com/2016/09/15/augmented-reality-and-autonomous-vehicles-enabled-by-the-same-technologies/
using cameras to capture objects as well as images https://digitalworlds.wordpress.com/2016/09/15/using-cameras-to-capture-objects-as-well-as-images/
interlude – ginger facial rigging model https://digitalworlds.wordpress.com/2016/09/13/interlude-ginger-facial-rigging-model/
recap – enabling the impossible https://digitalworlds.wordpress.com/2016/09/12/interval-feature-enabling-the-impossible/
the photorealistic effect… https://digitalworlds.wordpress.com/2016/09/08/the-photorealistic-effect/
churnalism times - polls (search recent polls/surveys) https://cse.google.com/cse/publicurl?cx=016419300868826941330:ewbi9skvnmq
churnalism times (search press releases) https://cse.google.com/cse/publicurl?cx=016419300868826941330:wvfrmcn2oxc&q=
coursedetective http://coursedetective.co.uk
uk university libraries http://ouseful.open.ac.uk/i/mobcse.html
ouseful http://www.google.com/coop/cse?cx=009190243792682903990:2ke2vucb8nm
how do i? http://ouseful.open.ac.uk/howdoi
@psychemedia tweets http://twitter.com/psychemedia
twitter.com/alberto_cottic… https://twitter.com/alberto_cottica/status/846810917144842240
8 hours ago http://twitter.com/psychemedia/statuses/846811750259462145
8 hours ago http://twitter.com/psychemedia/statuses/846806602065817600
15 hours ago http://twitter.com/psychemedia/statuses/846704843486105604
follow @psychemedia http://twitter.com/psychemedia
- http://psychemedia.tumblr.com/rss
tumbling… http://psychemedia.tumblr.com/
"so while the broadcasters (unlike the press) may have passed the test of impartiality during the..." http://psychemedia.tumblr.com/post/146446797887
"finding the story in 150 million rows of data" http://psychemedia.tumblr.com/post/67741985219
"to live entirely in public is a form of solitary confinement." http://psychemedia.tumblr.com/post/66582526150
icts and anti-corruption: theory and examples | tim's blog http://psychemedia.tumblr.com/post/66174594908
"instead of getting more context for decisions, we would get less; instead of seeing the logic..." http://psychemedia.tumblr.com/post/66084314090
"bbc r&d is now winding down the current uas activity and this conference marked a key stage in..." http://psychemedia.tumblr.com/post/66082591558
"the vc/ipo money does however distort the market, look at amazon’s ‘profit’..." http://psychemedia.tumblr.com/post/66082099095
"newsreader will process news in 4 different languages when it comes in. it will extract what..." http://psychemedia.tumblr.com/post/64694672362
governance | the openspending blog http://psychemedia.tumblr.com/post/64679109300
"the reality of news media is that once the documents are posted online, they lose a lot of value. a..." http://psychemedia.tumblr.com/post/64677949172
tracking down data files associated with parliamentary business https://blog.ouseful.info/2017/03/23/tracking-down-data-files-associated-with-parliamentary-business/
data cleaning – finding near matches in names https://blog.ouseful.info/2017/03/20/data-cleaning-finding-near-matches-in-names/
talking to developers and civic hackers on their own terms… https://blog.ouseful.info/2017/03/19/talking-to-developers/
tinkering with parliament data apis: commons written questions and parliamentary written answers https://blog.ouseful.info/2017/03/18/tinkering-with-parliament-data-apis-commons-written-questions-and-parliamentary-written-answers/
experimenting with sankey diagrams in r and python https://blog.ouseful.info/2017/03/17/experimenting-with-sankey-diagrams-in-r-and-python/
seven graphical interfaces to docker https://blog.ouseful.info/2015/08/10/seven-graphical-interfaces-to-docker/
using google spreadsheets like a database - the query formula https://blog.ouseful.info/2010/01/19/using-google-spreadsheets-like-a-database-the-query-formula/
seven ways of running ipython / jupyter notebooks https://blog.ouseful.info/2014/12/12/seven-ways-of-running-ipython-notebooks/
displaying events from multiple google calendars in a single embedded calendar view https://blog.ouseful.info/2008/12/04/displaying-events-from-multiple-google-calendars-in-a-single-embedded-calendar-view/
experimenting with sankey diagrams in r and python https://blog.ouseful.info/2017/03/17/experimenting-with-sankey-diagrams-in-r-and-python/
creating your own results charts for surveys created with google forms https://blog.ouseful.info/2009/02/17/creating-your-own-results-charts-for-surveys-created-with-google-forms/
merging datasets with common columns in google refine https://blog.ouseful.info/2011/05/06/merging-datesets-with-common-columns-in-google-refine/
updating google calendars from a google spreadsheet https://blog.ouseful.info/2010/03/04/maintaining-google-calendars-from-a-google-spreadsheet/
blog at wordpress.com. https://wordpress.com/?ref=footer_blog
ouseful.info, the blog… https://blog.ouseful.info/
blog at wordpress.com. https://wordpress.com/?ref=footer_blog

Zdjęcia

Zdjęcia 38
Zdjęcia bez atrybutu ALT 35
Zdjęcia bez atrybutu TITLE 38
Korzystanie Obraz ALT i TITLE atrybutu dla każdego obrazu.

Zdjęcia bez atrybutu TITLE

https://ouseful.files.wordpress.com/2017/03/filetype_xlsx_site_parliament_uk_-_google_search.png?w=629&h=409
https://ouseful.files.wordpress.com/2017/03/filetype_xls_or_filetype_csv_or_filetype_xlsx__site_https___www_parliament_uk_documents_foi__-_google_search.png?w=535&h=446
https://ouseful.files.wordpress.com/2017/03/written_questions_and_answers_-_uk_parliament21.png?w=592&h=292
https://ouseful.files.wordpress.com/2017/03/filetype_xls_or_filetype_csv_or_filetype_xlsx__site_http___qna_files_parliament_uk_qna-attachments_-_google_search.png?w=555&h=351
https://ouseful.files.wordpress.com/2017/03/writtenquestions-attachment.png?w=578&h=298
https://ouseful.files.wordpress.com/2017/03/written_statements_-_uk_parliament.png?w=733&h=473
https://ouseful.files.wordpress.com/2017/03/deposited_papers_-_uk_parliament2.png?w=570&h=550
https://ouseful.files.wordpress.com/2017/03/depositedpapers_files__filetype_xls_or_filetype_csv_or_filetype_xlsx__-_google_search.png?w=541&h=428
https://ouseful.files.wordpress.com/2017/03/research_briefings.png?w=486&h=353
https://ouseful.files.wordpress.com/2017/03/superfast_broadband_coverage_in_the_uk_-_commons_library_briefing_-_uk_parliament.png?w=512&h=251
https://ouseful.files.wordpress.com/2017/03/briefingnaturalid.png?w=643&h=180
https://ouseful.files.wordpress.com/2017/03/briefingjson.png?w=554&h=171
https://ouseful.files.wordpress.com/2017/03/nearmisses.png?w=830&h=204
https://ouseful.files.wordpress.com/2013/12/other-clusters.png?w=579&h=257
https://ouseful.files.wordpress.com/2017/03/lords_amendments_-_abortion__disability_equality__bill__hl__committee_stage_-_decisions__all_-_member__all.png?w=608&h=767
https://ouseful.files.wordpress.com/2017/03/view-source_https___www_theguardian_com_uk.png?w=593&h=294
https://ouseful.files.wordpress.com/2017/03/developer_and_view.png?w=569&h=340
https://ouseful.files.wordpress.com/2017/03/written_questions_and_answers_-_uk_parliament.png?w=546&h=665
https://ouseful.files.wordpress.com/2017/03/writtenquestions.png?w=699&h=257
https://ouseful.files.wordpress.com/2017/03/writtenquestions21.png?w=568&h=512
https://ouseful.files.wordpress.com/2017/03/writtenquestions5.png?w=622&h=235
https://ouseful.files.wordpress.com/2017/03/writtenquestions3.png?w=713&h=295
https://ouseful.files.wordpress.com/2017/03/writtenquestions4.png?w=679&h=443
https://ouseful.files.wordpress.com/2017/03/oli_hawkins__internal_migration_flows_in_the_uk.png?w=555&h=416
https://ouseful.files.wordpress.com/2017/03/dropbox_parlidata_rdemos_sankeyflow_html.png?w=508&h=571
https://ouseful.files.wordpress.com/2017/03/dropbox_parlidata_rdemos_sankeyflow_html2.png?w=511&h=468
https://ouseful.files.wordpress.com/2017/03/dropbox_parlidata_rdemos_sankeyflow_html3.png?w=464&h=456
https://ouseful.files.wordpress.com/2017/03/dropbox_parlidata_rdemos_sankeyflow_html4.png?w=490&h=455
https://ouseful.files.wordpress.com/2017/03/dropbox_parlidata_rdemos_sankeyflow_html5.png?w=474&h=436
https://ouseful.files.wordpress.com/2017/03/migrantflow.png?w=504&h=380
https://ouseful.files.wordpress.com/2017/03/migrantflow2.png?w=469&h=438
https://i2.wp.com/i.creativecommons.org/l/by/3.0/88x31.png
https://i0.wp.com/www.google.com/images/icons/ui/gprofile_button-16.png
https://i0.wp.com/www.feedburner.com/fb/images/pub/feed-icon32x32.png
https://s-ssl.wordpress.com/wp-includes/images/rss.png?m=1354137473h
https://s-ssl.wordpress.com/wp-includes/images/rss.png?m=1354137473h
https://sb.scorecardresearch.com/p?c1=2&c2=7518284&c3=&c4=&c5=&c6=&c15=&cv=2.0&cj=1
https://pixel.wp.com/b.gif?v=noscript

Zdjęcia bez atrybutu ALT

https://ouseful.files.wordpress.com/2017/03/filetype_xlsx_site_parliament_uk_-_google_search.png?w=629&h=409
https://ouseful.files.wordpress.com/2017/03/filetype_xls_or_filetype_csv_or_filetype_xlsx__site_https___www_parliament_uk_documents_foi__-_google_search.png?w=535&h=446
https://ouseful.files.wordpress.com/2017/03/written_questions_and_answers_-_uk_parliament21.png?w=592&h=292
https://ouseful.files.wordpress.com/2017/03/filetype_xls_or_filetype_csv_or_filetype_xlsx__site_http___qna_files_parliament_uk_qna-attachments_-_google_search.png?w=555&h=351
https://ouseful.files.wordpress.com/2017/03/writtenquestions-attachment.png?w=578&h=298
https://ouseful.files.wordpress.com/2017/03/written_statements_-_uk_parliament.png?w=733&h=473
https://ouseful.files.wordpress.com/2017/03/deposited_papers_-_uk_parliament2.png?w=570&h=550
https://ouseful.files.wordpress.com/2017/03/depositedpapers_files__filetype_xls_or_filetype_csv_or_filetype_xlsx__-_google_search.png?w=541&h=428
https://ouseful.files.wordpress.com/2017/03/research_briefings.png?w=486&h=353
https://ouseful.files.wordpress.com/2017/03/superfast_broadband_coverage_in_the_uk_-_commons_library_briefing_-_uk_parliament.png?w=512&h=251
https://ouseful.files.wordpress.com/2017/03/briefingnaturalid.png?w=643&h=180
https://ouseful.files.wordpress.com/2017/03/briefingjson.png?w=554&h=171
https://ouseful.files.wordpress.com/2017/03/nearmisses.png?w=830&h=204
https://ouseful.files.wordpress.com/2013/12/other-clusters.png?w=579&h=257
https://ouseful.files.wordpress.com/2017/03/lords_amendments_-_abortion__disability_equality__bill__hl__committee_stage_-_decisions__all_-_member__all.png?w=608&h=767
https://ouseful.files.wordpress.com/2017/03/view-source_https___www_theguardian_com_uk.png?w=593&h=294
https://ouseful.files.wordpress.com/2017/03/developer_and_view.png?w=569&h=340
https://ouseful.files.wordpress.com/2017/03/written_questions_and_answers_-_uk_parliament.png?w=546&h=665
https://ouseful.files.wordpress.com/2017/03/writtenquestions.png?w=699&h=257
https://ouseful.files.wordpress.com/2017/03/writtenquestions21.png?w=568&h=512
https://ouseful.files.wordpress.com/2017/03/writtenquestions5.png?w=622&h=235
https://ouseful.files.wordpress.com/2017/03/writtenquestions3.png?w=713&h=295
https://ouseful.files.wordpress.com/2017/03/writtenquestions4.png?w=679&h=443
https://ouseful.files.wordpress.com/2017/03/oli_hawkins__internal_migration_flows_in_the_uk.png?w=555&h=416
https://ouseful.files.wordpress.com/2017/03/dropbox_parlidata_rdemos_sankeyflow_html.png?w=508&h=571
https://ouseful.files.wordpress.com/2017/03/dropbox_parlidata_rdemos_sankeyflow_html2.png?w=511&h=468
https://ouseful.files.wordpress.com/2017/03/dropbox_parlidata_rdemos_sankeyflow_html3.png?w=464&h=456
https://ouseful.files.wordpress.com/2017/03/dropbox_parlidata_rdemos_sankeyflow_html4.png?w=490&h=455
https://ouseful.files.wordpress.com/2017/03/dropbox_parlidata_rdemos_sankeyflow_html5.png?w=474&h=436
https://ouseful.files.wordpress.com/2017/03/migrantflow.png?w=504&h=380
https://ouseful.files.wordpress.com/2017/03/migrantflow2.png?w=469&h=438
https://i0.wp.com/www.google.com/images/icons/ui/gprofile_button-16.png
https://i0.wp.com/www.feedburner.com/fb/images/pub/feed-icon32x32.png
https://sb.scorecardresearch.com/p?c1=2&c2=7518284&c3=&c4=&c5=&c6=&c15=&cv=2.0&cj=1
https://pixel.wp.com/b.gif?v=noscript

Ranking:


Alexa Traffic
Daily Global Rank Trend
Daily Reach (Percent)









Majestic SEO











Text on page:

ouseful.info, the blog...trying to find useful things to do with emerging technologies in open education and data journalism about editorial policy search march 23, 2017 tracking down data files associated with parliamentary business one of the ways of finding data related files scattered around an organisations website is to run a web search using a search limit that specifies a data-y filetype, such as xlsx  for an excel spreadsheet (csv and xls are also good candidates). for example, on the parliament website, we could run a query along the lines of filetype:xlsx site:parliament.uk and then opt to display the omitted results: taken together, these files form an ad hoc datastore (e.g. as per this demo on using foi response on whatdotheyknow as an “as if” open datastore). looking at the urls, we see that data containing files are strewn about the online parliamentary estate (that is, the website;-)… freedom of information related datasets parliament seems to be quite open in the way is handles its foi responses, publishing disclosure logs and releasing datafile attachments rooted on https://www.parliament.uk/documents/foi/: written questions responses to written questions often come with datafile attachments. these are files are posted to the subdomain http://qna.files.parliament.uk/qna-attachments. given the numeric key for a particular question, we can run a query on the written answers api to find details about the attachment: looking at the actual url , something like http://qna.files.parliament.uk/qna-attachments/454264/original/28152%20-%20table.xlsx, it looks as if some guesswork is required generating the url from the data contained in the api response? (for example, how might original attachments might distinguish from other attachments (such as “revised” ones, maybe?).) written statements written statements often come with one of more data file attachments. the data files also appear on the http://qna.files.parliament.uk/ subdomain although it looks like they’re on a different path to the answered question attachments (http://qna.files.parliament.uk/ws-attachments compared to http://qna.files.parliament.uk/qna-attachments). this subdomain doesn’t appear to have the data files indexed and searchable on google? i don’t see a written statements api on http://explore.data.parliament.uk/ either? deposited papers deposited papers often include supporting documents, including spreadsheets. files are located under http://data.parliament.uk/depositedpapers/files/: at the current time there is no api search over deposited papers. committee papers a range of documents may be associated with committees, including reports, responses to reports, and correspondence, as well as evidence submissions. these appear to mainly be pdf documents. written evidence documents are rooted on http://data.parliament.uk/writtenevidence/committeeevidence.svc/evidencedocument/ and can be found from committee written evidence web (html) pages rooted on the same path (example). a web search for site:parliament.uk inurl:committee (filetype:xls or filetype:csv or filetype:xlsx) doesn’t turn up any results. parliamentary research briefings research briefings are published by commons and lords libraries, and may include additional documents. briefings may be published along with supporting documents, including spreadsheets: the files are published under the following subdomain and path:  http://researchbriefings.files.parliament.uk/. the file attachments urls can be found via the research briefings api. this response is a cut down result – the full resource description, including links to supplementary items, can be found by keying on the numeric identifier from the uri _about which the “naturally” identified resource (e.g. sn06643) is described. summary data files can be found variously around the parliamentary website, including down the following paths: https://www.parliament.uk/documents/foi/ http://qna.files.parliament.uk/qna-attachments (appear in written answers api results); http://qna.files.parliament.uk/ws-attachments http://data.parliament.uk/depositedpapers/files/ http://researchbriefings.files.parliament.uk/ (appear in research briefings api results) (i don’t think the api supports querying resources that specifically include attachments in general, or attachments of a particular filetype?) what would be nice would be support for discovering some of these resources. a quick way in to this would be the ability to limit search query responses to webpages that link to a data file, on the grounds that the linking web page probably contains some of the keywords that you’re likely to be searching for data around? written by tony hirst leave a comment posted in anything you want tagged with parlidata march 20, 2017 data cleaning – finding near matches in names in the post what nationality did you say you were, again? i showed how we could use the fuzzyset python library to try to reconcile user supplied nationalities entered via a free text entry form to “preferred” nationalities listed in the fco register of country names. here’s another quick example of how to use fuzzyset to help clean a list of names, possibly repeated, that may include near misses or partial matches. import pandas as pd names=['s. smith', 'john smith','james brown','john brown','t. smith','john brown'] df=pd.dataframe({'name':names}) # set the thresh value (0..1) to tweak match strength thresh=0.8 import fuzzyset names=df['name'].tolist() cleaner = fuzzyset.fuzzyset() collisions=[] for name in names: maybe=cleaner.get(name) # if there is a possible match, get a list of tuples back: (score, matchstring) # the following line finds the highest match score m=max(maybe,key=lambda item:item[0]) if maybe is not none else (-1,'') # if there is no match, or the max match score is below the threshold value, if not maybe or m[0] < thresh: # assume that it's not a match and add name to list of "clean" names… cleaner.add(name) elif m[0] >= thresh: # but if there is a possible match, print a warning txt='assuming {} is a match with {} ({}) so not adding'.format(name,m[1],m[0]) print(txt) # and add the name to a collisions list collisions.append((name,m)) #now generate a simple report print('------\n\n# cleaning report:\n\n## match set:\n{}\n\n------\n\n## collisions:\n{}'.format(cleaner.exact_set, collisions)) the report looks something like this: sometimes, you may want to be alerted to exact matches; for example, if you are expecting the values to all be unique. however, at other times, you may be happy to ignore duplicates, in which case you might consider dropping them from the names list. one way to do this is to convert the names list to a set, and back again, names=list(set(names)), although this changes the list order. alternatively, from the pandas dataframe column, just take unique values: names=df['name'].unique().tolist(). you may also want to know how many times duplicate (exact matches) occur. in such a case, we can list items that appear at list twice in the names list using the pandas dataframe value_counts() method: #get a count of the number of times each value occurs in a column, along with the value vals=df['name'].value_counts() #select items where the value count is greater than one vals[vals > 1] #john brown 2 ps another way of detecting, and correcting, near matches is to use an application such as openrefine, in particular its clustering tool: written by tony hirst 2 comments posted in anything you want march 19, 2017 talking to developers and civic hackers on their own terms… looking at the (new to me) lords amendments  website yesterday, i wondered whether the search was being fed by an api call, or whether an api is available elsewhere to the underlying data. (an api is available – find it via explore.data.parliament.uk.) there are a couple of ways of doing this. one way is to view source” (in chrome, view -> developer -> view source), because as everybody *should know, you can inspect the code running in your browser; another is to use developer tools, (from the same browser menu) to look at the browser network activity and see what urls are called when a new selection is made on the web page (the data has to come from somewhere right? and again, you can look at this if you want to.) anyway, it struck me that most folk don’t tend to use these tools, but those who do are probably interested in something you’re doing – either how the page was constructed to give a particular effect, or where the data is coming from. if you’re building screenscraper’s, you’d typically look to the source too. so if you’re trying to engage with developers, whey not leave them messages where they’re likely to look. for example, if you want to promote an api, or perhaps if you’re recruiting. which reminded me that the guardian used to have an open developer recruitment ad running in their webpage source. indeed, they still do: so if your page is api powered somewhere along the line, and you want to promote the api, why not pop a message at the top of the page source? or, as i learned from james bridle (he of the new aesthetic; you do follow that photoblog, right?), one of the most thought provoking artists around at the moment (i hesitate to say “digital artist” because that’s still an artist, right… erm…. (data) journalism… erm…  hypocrite…), why not use the console too? james even provides a script to help…. welcome.js. ps for a recent example of james’ work, which also invokes the idea of magic-related computing metaphors (cf. here, for example), see this recent interview: meet the artist using ritual magic to trap self-driving cars. pps this has got me wondering whether we could actually deliver a “just below the surface” uncourse or training through html source, console messages and javascript comments. documented code with a view to teaching how to get the most out of an api, or how to do webdesign. the medium as the educational message. see also: search engine powered courses… written by tony hirst leave a comment posted in anything you want march 18, 2017 tinkering with parliament data apis: commons written questions and parliamentary written answers so…. inspired by @philbgorman, i had a quick play last night with parliament written questions data, putting together a recipe (output) for plotting a sankey diagram showing the flow of questions from members of the house of commons by party to various answering bodies for a particular parliamentary session. the response that comes back from the written questions api includes a question uin (unique identification number?). if you faff around with date settings on the parliamentary questions web page you can search for a question by this id: here’s an example of the response from a build download of questions (by 2015/16 session) from the commons written questions api, deep filtered by the uin: if you tweak the _about uri, which i think refers to details about the question, you get the following sort of response, built around a numeric identifier (447753 in this case): there’s no statement of the actual answer text in that response, although there is a reference to an answer resource, again keyed by the same numeric key: the numeric key from the _about identifier is also used with both the commons written questions api and the parliamentary questions answered api. for example, questions: and answers: the uin values can’t be used with either of these apis, though? ps i know, i know, the idea is that we just follow resource links (but they’re broken, right? the leading lda. is missing from the http identifiers), but sometimes it’s just as easy to take a unique fragment of the uri (like the numeric key) and then just drop it into the appropriate context when you want it. in this case, contexts are http://lda.data.parliament.uk/resources/{}.json http://lda.data.parliament.uk/commonswrittenquestions/{}.json http://lda.data.parliament.uk/resources/{}/answer.json http://lda.data.parliament.uk/answeredquestions/{}.json imho, any way… ;-) pps for a full list of apis, see explore.data.parliament.uk written by tony hirst leave a comment posted in anything you want tagged with parlidata march 17, 2017 experimenting with sankey diagrams in r and python a couple of days ago, i spotted a post by oli hawkins on visualising migration between the countries of the uk which linked to a sankey diagram demo of internal migration flows in the uk. one of the things that interests me about the jupyter and rstudio centred reproducible research ecosystems is their support for libraries that generate interactive html/javascript outputs (charts, maps, etc) from a computational data analysis context such as r, or python/pandas, so it was only natural (?!) that i though i should see how easy it would be to generate something similar from a code context. in an r context, there are several libraries available that support the generation of sankey diagrams, including googlevis (which wraps google chart tools), and a couple of packages that wrap d3.js – an original rcharts sankey diagram demo by @timelyporfolio, and a more recent htmlwidgets demo (sankeyd3). here’s an example of the evolution of my sankey diagram in r using googlevis – the rmd code is here and a version of the knitred html output is here. the original data comprised a matrix relating population flows between english regions, wales, scotland and northern ireland. the simplest rendering of the data using the googleviz sankey diagram generator produces an output that uses default colours to label the nodes. using the country code indicator at the start of each region/country identifier, we can generate a mapping from country to a country colour that can then be used to identify the country associated with each node. one of the settings for the diagram allows the source (or target) node colour to determine the edge colour. we can also play with the values we use as node labels: if we exclude edges relating to flow between regions of the same country, we get a diagram that is more reminiscent of oli’s orignal (country level) demo. note also that the charts that are generated are interactive – in this case, we see a popup that describes the flow along one particular edge. if we associate a country with each region, we can group the data and sum the flow values to produce country level flows. charting this produces a chart similar to the original inspiration. as well as providing the code for generating each of the above sankey diagrams, the rmd file linked above also includes demonstrations for generating basic sankey diagrams for the original dataset using the rcharts and htmlwidgets r libraries. in order to provide a point of comparison, i also generated a python/pandas workflow using jupyter notebooks and the ipysankey widget. (in fact, i generated the full workflow through the different chart versions first in pandas – i find it an easier language to think in than r! – and then used that workflow as a crib for the r version…) the original notebook is here and an example of the html version of it here. note that i tried to save a rasterisation of the widgets but they don’t seem to have turned out that well… the original (default) diagram looks like this: and the final version, after a bit of data wrangling, looks like this: once again, all the code is provided in the notebook. one of the nice things about all these packages is that they produce outputs than can be reused/embedded elsewhere, or that can be used as a first automatically produced draft of code that can be tweaked by hand. i’ll have more to say about that in a future post… written by tony hirst 2 comments posted in anything you want, rstats tagged with jupyter, parlidata march 15, 2017 grouping numbers that are nearly the same – casual clustering a couple of reasons for tinkering with wrc rally data this year, over and the above the obvious of wanting to find a way to engage with motorsport at a data level, specifically, i wanted a context for thinking a bit more about ways of generating (commentary) text from timing data, as well as a “safe” environment in which i could look for ways of identifying features (or storypoints) in the data that might provide a basis for making interesting text comments. one way in to finding features is to look at a visual representations of the data (that is, just look at charts) and see what jumps out… if anything does, then you can ponder ways of automating the detection or recognition of those visually compelling features, or things that correspond to them, or proxy for them, in some way. i’ll give an example of that in the next post in this series, but for now, let’s consider the following question:how can we group numbers that are nearly the same? for example, if i have a set of stage split times, how can i identify groups of drivers that have recorded exactly, or even just nearly, the same time? via stackoverflow, i found the following handy fragment: def cluster(data, maxgap): '''arrange data into groups where successive elements differ by no more than *maxgap* cluster([1, 6, 9, 100, 102, 105, 109, 134, 139], maxgap=10) [[1, 6, 9], [100, 102, 105, 109], [134, 139]] cluster([1, 6, 9, 99, 100, 102, 105, 134, 139, 141], maxgap=10) [[1, 6, 9], [99, 100, 102, 105], [134, 139, 141]] ''' data.sort() groups = [[data[0]]] for x in data[1:]: if abs(x - groups[-1][-1]) <= maxgap: groups[-1].append(x) else: groups.append([x]) return groups print(cluster([2.1,7.4,3.9,4.6,2.5,2.4,2.52],0.35)) [[2.1, 2.4, 2.5, 2.52], [3.9], [4.6], [7.4]] it struck me that a tweak to the code could limit the range of any grouping relative to a maximum distance between the first and the last number in any particular grouping – maybe i don’t want a group to have a range more than 0.41 for example (that is, strictly more than a dodgy floating point 0.4…): def cluster2(data, maxgap, maxrange=none): data.sort() groups = [[data[0]]] for x in data[1:]: inmaxrange = true if maxrange is none else abs(x-groups[-1][0]) <=maxrange if abs(x - groups[-1][-1]) <= maxgap and inmaxrange: groups[-1].append(x) groups[-1].append(x) else: groups.append([x]) return groups print(cluster2([2.1,7.4,3.9,4.6,2.5,2.4,2.52],0.35,0.41)) [[2.1, 2.4, 2.5], [2.52], [3.9], [4.6], [7.4]] a downside of this is we might argue we have mistakenly omitted a number that is very close to the last number in the previous group, when we should rightfully have included it, because it’s not really very far away from a number that is close to the group range threshold value… in which case, we might pull back numbers into a group that are really close to the current last member in the group irrespective of whether we past the originally specified group range: def cluster3(data, maxgap, maxrange=none, maxminrange=none): data.sort() groups = [[data[0]]] for x in data[1:]: inmaxrange = true if maxrange is none else abs(x-groups[-1][0])<=maxrange inmaxminrange = false if maxminrange is none else abs(x-groups[-1][-1])<=maxminrange if (abs(x - groups[-1][-1]) <= maxgap and inmaxrange) or inmaxminrange: groups[-1].append(x) else: groups.append([x]) return groups print(cluster3([2.1,7.4,3.9,4.6,2.5,2.4,2.52],0.35,0.41,0.25)) [[2.1, 2.4, 2.5, 2.52], [3.9], [4.6], [7.4]] with these simple fragments, i can now find groups of times that are reasonably close to each other. i can also look for times that are close to other times: trythis = [x for x in cluster3([2.1,7.4,3.9,4.6,2.5,2.4,2.52],0.35,0.41,0.25) if 2.4 in x] trythis[0] if len(trythis) else '' [2.1, 2.4, 2.5, 2.52] ps i think the following vectorised pandas fragments assign group numbers to rows based on the near matches of numerics in a specified column: def numclustergroup(x,col,maxgap): x=x.sort_values(col) x['cluster'] = (x[col].diff()>=maxgap).cumsum() return x def numclustergroup2(x,col,maxgap,maxrange): x=x.sort_values(col) x['cluster'] = (x[col].diff()>=maxgap).cumsum() x['cdiff']=x.groupby('cluster')[col].diff() x['cluster'] = ((x.groupby('cluster')['cdiff'].cumsum()>maxrange) | (x[col].diff()>=maxgap)).cumsum() return x.drop('cdiff',1) def numclustergroup3(x,col,maxgap,maxrange,maxminrange): x=x.sort_values(col) x['cluster'] = (x[col].diff()>=maxgap).cumsum() x['cdiff']=x.groupby('cluster')[col].diff() x['cluster'] = (((x.groupby('cluster')['cdiff'].cumsum()>maxrange) | (x[col].diff()>=maxgap)) & (x[col].diff()>maxminrange) ).cumsum() return x.drop('cdiff',1) #test uu=pd.dataframe({'x':list(range(0,8)),'y':[1.3,2.1,7.4,3.9,4.6,2.5,2.4,2.52]}) numclustergroup(uu,'y',0.35) numclustergroup2(uu,'y',0.35,0.41) numclustergroup3(uu,'y',0.35,0.41,0.25) the basic idea is to generate logical tests that evaluate as true whenever you want to increase the group number. written by tony hirst 3 comments posted in anything you want, data, tinkering tagged with data, wrc march 15, 2017 transparency in parliament… and in data journalism? over the weekend, i picked up a copy of parliament ltd, a two hundred and fifty page rant (or should that be diatribe?!) against various mps and lords and their registered (and unregistered) interests. [disclosure: i’ve picked up a few days paid work for the parliamentary digital service this year.] the book draws on data scraped from the parliament website (presumably), as well as companies house (via a collaboration – or business arrangement? i wasn’t quite sure..?! – with duedil). as anyone who’s tried looking at registers of interests on the parliament website, you’ll know they’re not published in the friendliest of formats, and the data is not made available as a machine readable downloadable dataset. sources of “interests” data from parliament by the by, the registers maintained on the parliament website include: register of members’ financial interests (a scraper and csv bulk download(ish!) is available for this information from maxharlow/scrape-members-financial-interests; theyworkforyou also provide members’ interests data); register of lords’ interests; register of interests of members’ secretaries and research assistants; register of journalists’ interests. there’s also the register of all-party groups, which includes statements of benefits received by groups from third parties (links to old scrapers here, possibly? another place we might look for associations between mps/lords and companies, or other organisations, is in hansard. for example, evan odell recently published a dataset on hansard speeches and sentiment that “provides information on each speech of ten words or longer, made in the house of commons between 1980 and 2016, with information on the speaking mp, their party, gender and age at the time of the speech”. the r code is supplied, so we could presumably use that as a basis for running the transcripts through a named entity extractor to try to pull out the names of companies or organisation mentioned by each speaker (perhaps as well as something that looks out for declarations of interest mentioned whilst speaking?). it might also be interesting to try to match sentiment with organisation mentions?! where companies are mentioned in a debate, and the debate leads to a division (that is, a vote), we can then use sources such as the public whip to download information scraped from the parliament website about who voted how on which division, and perhaps look for mps voting against their party line but in favour of a particular interest. (if you know other sources of scraper code, or apis offered over scraped versions of any of the above registers, please let me know via the comments and i’ll add them in. also any registers i’ve missed…) others sources of data relating to members’ parliamentary and government activities by the by, the appg post also led me to another old post on scraping ministers’ meetings. for an idea of the sorts of thing currently disclosed (at a departmental level?), see e.g. cabinet office: ministers’ transparency publications). there are possibly other forms of declaration on other government department websites? in relation to lobbying firms, there is the office of the registrar of consultant lobbyists. also outside parliament, the electoral commission provide information about donations and loans to individuals (including mps) and candidate spending and donations at elections. other sources of information about members’ external interests companies house can also be used to look up whether a particular named individual is or has been listed as a company officer (such as a director), or is a person of significant control (psc, sometimes referred to as a “beneficial owner”) of a particular company. whilst the psc register is currently available as a bulk data download, the director information isn’t (at least, not without making a personal request). it can be accessed in a piecemeal fashion via the companies house api though. current and recently disqualified directors can be found via the insolvency service or the companies house api. the insolvency service also publish information about individual insolvency (that is, bankruptcies). where individuals are associated with an organisation and are registered as a data controller, they should also be listed as an entry on the ico data protection register. evan’s github account also hosts a fork of a repo published by the ncvo for import[ing] data from the charity commission data extract, data that presumably lists trustees, and again that can be used as the basis for finding associations between individuals and organisations. at a local level, local councils hold a variety of public registers, detailing for example the names of individuals licensed to sell alcohol, or to act as operators of betting, sex or animal breeding establishments. the cqc publish data listing the names of individuals in charge of operating care homes. nhs england list names of gps working at particular practices. and so on… more generally, the advisory committee on business appointments (acoba) has details of appointments taken up by former ministers. (acoba also report on appointments taken up by former crown servants.) so what? so that’s all so much data, and as martin williams points out in his book, it can take a lot of effort to pull the data into some sort of shape where you can use it. and with data sourced from various places, there may be issues associated with sharing the data on once you have processed it. to a certain extent, you might argue that parliament is blocking “transparency” around members’ interests – and possible conflicts of interest – by publishing the data in a way that makes it difficult to process it as data without having to do a fair amount of work prepping the data. but i’m not sure how true that is. journalists are, in part, competitive beasts, wanting to be the first to a story. if a data is well presented and comes with analysis scripts that identify story features and story points, essentially generating a press release around a dataset without much effort involved, there’s nothing there to find (nothing “hidden” in the data waiting for the intrepid journalist to reveal it). but when the data is messy and takes some effort to clean up, then the chances that anyone else will just stumble across the story point by chance are reduced. and when the data is “secret” but still publicly accessible, all the better. for example, it’s no surprise that a common request of alvateli (the platform underpinning foi request site whatdotheyknow) was from journalists wanting to be able to hide, or at least embargo, their requests, and (data) responses provided to them (h/t chris adams for that observation and link). another question that arises around journalists who do clean datasets and then analyse them but who don’t then share their working, (the data cleaning and analysis scripts), is the extent to which they are themselves complicit in acting against transparency. why should we believe the journalists’ accusations or explanations without seeing what they are actually based on? (maybe in cleaning the dataset, they threw away explicit declarations of interest because they were too messy to process which then skewed the conclusions drawn from the data analysis?) by sharing analyses, you also provide others with the opportunity to spot errors in your working, or maybe even improve them (scary for some; but consider the alternative: you produce an analysis script that contains an error, and maybe reuse it, generating claims that are false and that cannot be supported by the data. publishing those is not in your interest.) there also seems to be the implicit assumption that competitors are trying to steal your stories rather than find your own. they probably think and say the same about you. but who has the time to spend it all trying to crib over other people’s shoulders? (other than me of course;-)) on the other hand, there may be some commercial or competitive intelligence advantage in having a cleaned dataset that you can work with efficiently that is not available to other journalists or that you believe may hide further stories. (a similar argument to the latter is often made by academic researchers who do not want to share their research data, lest someone else makes a discovery from it that eluded them.) but then, with a first mover advantage, you should be able to work with your data and link it to other data sets faster than your competitors. and if they are sharing data back too, then you may be able to benefit from their cleaned data and analysis scripts. everyone gets to improve their game. another possible form of “competitive” advantage that comes from not publishing cleaned datasets or scripts is that is doesn’t tip the hand of the journalist and reveal investigative “trade secrets” to the subject or subjects of an investigation. for by revealing how a story was identified from a dataset, subjects may change their behaviour so as not to divulge information into the dataset in the same revealing way in the future. one final considerations: when it comes to news stories, what is the extent to which part-time tinkerers and civic tech hackers such as myself spoil a possible story by doing a halfway hack on a dataset, bringing small scale attention to it, and as a consequence disabling or polluting it as a source of journalistic novelty/story-worthiness? does anyone have examples of where a possible open data story was not pursued by the press because a local data geek blogger got there first? written by tony hirst leave a comment posted in anything you want tagged with ddj older posts © aj hirst 2008-2016 attribution: tony hirst. search for: contact email me (tony hirst) follow @psychemedia bookmarks presentations email subscription enter your email address to subscribe to this blog and receive notifications of new posts by email. join 1,766 other followers subscribe in a reader my other blogs f1datajunkie blog f1 data tinkerings digital worlds blog game design uncourse visual gadgets blog visualisation bits'n'pieces digital worlds blogaugmented reality and autonomous vehicles – enabled by the same technologies?using cameras to capture objects as well as imagesinterlude – ginger facial rigging modelrecap – enabling the impossiblethe photorealistic effect…custom search engines churnalism times - polls (search recent polls/surveys) churnalism times (search press releases) coursedetective uk university degree course prospectuses uk university libraries infoskills resources ouseful web properties search how do i? instructional video metasearch engine page hacks rss for the content of this page view posts in chronological order @psychemedia tweets the 2020s will be the decade of "oops.... erm.... now don't panic, but i think i just..." twitter.com/alberto_cottic… 8 hours ago when making pizza becomes np complete because everyone has different "don't likes" but no-one can be arsed to actually contribute.. 8 hours ago poking around a web search on / (filetype:ipynb or filetype:rmd) site:ac.uk / (notable absence close to home….) 15 hours ago follow @psychemedia tumbling…"so while the broadcasters (unlike the press) may have passed the test of impartiality during the...""finding the story in 150 million rows of data""to live entirely in public is a form of solitary confinement."icts and anti-corruption: theory and examples | tim's blog"instead of getting more context for decisions, we would get less; instead of seeing the logic...""bbc r&d is now winding down the current uas activity and this conference marked a key stage in...""the vc/ipo money does however distort the market, look at amazon’s ‘profit’...""newsreader will process news in 4 different languages when it comes in. it will extract what..."governance | the openspending blog"the reality of news media is that once the documents are posted online, they lose a lot of value. a..." recent posts tracking down data files associated with parliamentary business data cleaning – finding near matches in names talking to developers and civic hackers on their own terms… tinkering with parliament data apis: commons written questions and parliamentary written answers experimenting with sankey diagrams in r and python top posts seven graphical interfaces to docker using google spreadsheets like a database - the query formula seven ways of running ipython / jupyter notebooks displaying events from multiple google calendars in a single embedded calendar view experimenting with sankey diagrams in r and python creating your own results charts for surveys created with google forms merging datasets with common columns in google refine updating google calendars from a google spreadsheet archives archives select month march 2017 (11) february 2017 (5) january 2017 (19) december 2016 (12) november 2016 (8) october 2016 (9) september 2016 (11) july 2016 (10) june 2016 (7) may 2016 (21) april 2016 (16) march 2016 (22) february 2016 (10) january 2016 (12) december 2015 (12) november 2015 (6) october 2015 (10) september 2015 (13) august 2015 (10) july 2015 (12) june 2015 (17) may 2015 (8) april 2015 (13) march 2015 (11) february 2015 (13) january 2015 (18) december 2014 (9) november 2014 (5) october 2014 (9) september 2014 (14) august 2014 (5) july 2014 (12) june 2014 (7) may 2014 (4) april 2014 (11) march 2014 (5) february 2014 (11) january 2014 (5) december 2013 (6) november 2013 (10) october 2013 (6) september 2013 (8) august 2013 (7) july 2013 (3) june 2013 (8) may 2013 (17) april 2013 (18) march 2013 (10) february 2013 (14) january 2013 (22) december 2012 (12) november 2012 (23) october 2012 (9) september 2012 (15) august 2012 (16) july 2012 (18) june 2012 (5) may 2012 (22) april 2012 (23) march 2012 (20) february 2012 (19) january 2012 (23) december 2011 (19) november 2011 (18) october 2011 (20) september 2011 (20) august 2011 (18) july 2011 (23) june 2011 (24) may 2011 (17) april 2011 (17) march 2011 (16) february 2011 (14) january 2011 (20) december 2010 (24) november 2010 (23) october 2010 (26) september 2010 (27) august 2010 (11) july 2010 (35) june 2010 (17) may 2010 (23) april 2010 (22) march 2010 (37) february 2010 (26) january 2010 (17) december 2009 (10) november 2009 (14) october 2009 (20) september 2009 (20) august 2009 (19) july 2009 (18) june 2009 (19) may 2009 (13) april 2009 (17) march 2009 (24) february 2009 (18) january 2009 (30) december 2008 (25) november 2008 (26) october 2008 (23) september 2008 (15) august 2008 (20) july 2008 (5) blog at wordpress.com. ouseful.info, the blog… blog at wordpress.com. post to cancel


Here you find all texts from your page as Google (googlebot) and others search engines seen it.

Words density analysis:

Numbers of all words: 5492

One word

Two words phrases

Three words phrases

the - 7.17% (394)
data - 2.18% (120)
and - 2% (110)
for - 1.53% (84)
that - 1.42% (78)
you - 1.02% (56)
parliament - 0.89% (49)
with - 0.87% (48)
group - 0.86% (47)
max - 0.76% (42)
are - 0.75% (41)
use - 0.71% (39)
here - 0.69% (38)
ten - 0.67% (37)
from - 0.66% (36)
can - 0.66% (36)
not - 0.66% (36)
file - 0.66% (36)
name - 0.58% (32)
out - 0.58% (32)
list - 0.56% (31)
one - 0.56% (31)
his - 0.55% (30)
may - 0.55% (30)
age - 0.53% (29)
range - 0.53% (29)
this - 0.53% (29)
search - 0.51% (28)
per - 0.51% (28)
set - 0.49% (27)
api - 0.49% (27)
written - 0.47% (26)
all - 0.47% (26)
other - 0.46% (25)
also - 0.44% (24)
groups - 0.44% (24)
source - 0.42% (23)
form - 0.42% (23)
try - 0.4% (22)
files - 0.4% (22)
key - 0.4% (22)
question - 0.4% (22)
act - 0.4% (22)
look - 0.4% (22)
way - 0.38% (21)
web - 0.38% (21)
own - 0.38% (21)
any - 0.38% (21)
post - 0.38% (21)
want - 0.36% (20)
http - 0.36% (20)
time - 0.36% (20)
example - 0.36% (20)
interest - 0.36% (20)
thing - 0.36% (20)
names - 0.35% (19)
site - 0.35% (19)
- 0.35% (19)
some - 0.35% (19)
how - 0.33% (18)
they - 0.33% (18)
match - 0.33% (18)
but - 0.33% (18)
there - 0.31% (17)
value - 0.31% (17)
maxgap - 0.31% (17)
get - 0.31% (17)
about - 0.29% (16)
data. - 0.29% (16)
count - 0.29% (16)
march - 0.29% (16)
clean - 0.29% (16)
able - 0.29% (16)
register - 0.29% (16)
attachments - 0.27% (15)
questions - 0.27% (15)
find - 0.27% (15)
sum - 0.27% (15)
see - 0.27% (15)
now - 0.25% (14)
work - 0.25% (14)
maxrange - 0.25% (14)
blog - 0.25% (14)
which - 0.25% (14)
diagram - 0.25% (14)
2016 - 0.24% (13)
their - 0.24% (13)
dataset - 0.24% (13)
2015 - 0.24% (13)
sankey - 0.24% (13)
let - 0.22% (12)
page - 0.22% (12)
2014 - 0.22% (12)
times - 0.22% (12)
follow - 0.22% (12)
parliamentary - 0.22% (12)
interests - 0.22% (12)
where - 0.22% (12)
have - 0.22% (12)
2010 - 0.22% (12)
then - 0.22% (12)
2013 - 0.22% (12)
answer - 0.22% (12)
over - 0.22% (12)
response - 0.22% (12)
2009 - 0.22% (12)
2011 - 0.22% (12)
publish - 0.22% (12)
else - 0.22% (12)
2012 - 0.22% (12)
like - 0.2% (11)
down - 0.2% (11)
comment - 0.2% (11)
google - 0.2% (11)
using - 0.2% (11)
script - 0.2% (11)
flow - 0.2% (11)
number - 0.2% (11)
code - 0.2% (11)
particular - 0.2% (11)
text - 0.2% (11)
same - 0.18% (10)
country - 0.18% (10)
research - 0.18% (10)
hirst - 0.18% (10)
sources - 0.18% (10)
what - 0.18% (10)
them - 0.18% (10)
2.4 - 0.18% (10)
around - 0.18% (10)
your - 0.18% (10)
story - 0.18% (10)
information - 0.18% (10)
2017 - 0.18% (10)
more - 0.18% (10)
common - 0.18% (10)
website - 0.18% (10)
chart - 0.18% (10)
come - 0.18% (10)
december - 0.16% (9)
again - 0.16% (9)
january - 0.16% (9)
october - 0.16% (9)
september - 0.16% (9)
member - 0.16% (9)
july - 0.16% (9)
sort - 0.16% (9)
data, - 0.16% (9)
2.4, - 0.16% (9)
just - 0.16% (9)
journalist - 0.16% (9)
maybe - 0.16% (9)
february - 0.16% (9)
link - 0.16% (9)
documents - 0.16% (9)
know - 0.16% (9)
point - 0.16% (9)
include - 0.16% (9)
tony - 0.16% (9)
each - 0.16% (9)
than - 0.16% (9)
provide - 0.16% (9)
when - 0.16% (9)
2.52] - 0.16% (9)
original - 0.16% (9)
might - 0.16% (9)
november - 0.16% (9)
used - 0.16% (9)
posted - 0.16% (9)
resource - 0.16% (9)
near - 0.15% (8)
def - 0.15% (8)
take - 0.15% (8)
should - 0.15% (8)
members - 0.15% (8)
[2.1, - 0.15% (8)
new - 0.15% (8)
lose - 0.15% (8)
generate - 0.15% (8)
values - 0.15% (8)
2.5, - 0.15% (8)
too - 0.15% (8)
anything - 0.15% (8)
well - 0.15% (8)
turn - 0.15% (8)
commons - 0.15% (8)
via - 0.15% (8)
april - 0.15% (8)
june - 0.15% (8)
these - 0.15% (8)
august - 0.15% (8)
such - 0.15% (8)
example, - 0.15% (8)
lists - 0.13% (7)
were - 0.13% (7)
2.52], - 0.13% (7)
html - 0.13% (7)
view - 0.13% (7)
recent - 0.13% (7)
including - 0.13% (7)
(18) - 0.13% (7)
individual - 0.13% (7)
possible - 0.13% (7)
is, - 0.13% (7)
following - 0.13% (7)
close - 0.13% (7)
(17) - 0.13% (7)
none - 0.13% (7)
run - 0.13% (7)
maxminrange - 0.13% (7)
briefings - 0.13% (7)
companies - 0.13% (7)
who - 0.13% (7)
add - 0.13% (7)
repo - 0.13% (7)
pandas - 0.13% (7)
(23) - 0.13% (7)
open - 0.13% (7)
between - 0.13% (7)
oli - 0.13% (7)
matches - 0.13% (7)
support - 0.13% (7)
python - 0.13% (7)
9], - 0.13% (7)
context - 0.13% (7)
2008 - 0.13% (7)
numeric - 0.13% (7)
associate - 0.13% (7)
think - 0.13% (7)
was - 0.13% (7)
(10) - 0.13% (7)
book - 0.13% (7)
(20) - 0.13% (7)
though - 0.13% (7)
another - 0.13% (7)
available - 0.13% (7)
(5) - 0.13% (7)
has - 0.11% (6)
don’t - 0.11% (6)
produce - 0.11% (6)
appear - 0.11% (6)
developer - 0.11% (6)
organisation - 0.11% (6)
charts - 0.11% (6)
comments - 0.11% (6)
because - 0.11% (6)
even - 0.11% (6)
generating - 0.11% (6)
comes - 0.11% (6)
ways - 0.11% (6)
analysis - 0.11% (6)
diagrams - 0.11% (6)
associated - 0.11% (6)
does - 0.11% (6)
version - 0.11% (6)
abs(x - 0.11% (6)
house - 0.11% (6)
looks - 0.11% (6)
demo - 0.11% (6)
return - 0.11% (6)
0.41 - 0.11% (6)
very - 0.11% (6)
course - 0.11% (6)
press - 0.11% (6)
url - 0.11% (6)
journalists - 0.11% (6)
papers - 0.11% (6)
note - 0.11% (6)
result - 0.11% (6)
crib - 0.11% (6)
print - 0.11% (6)
members’ - 0.11% (6)
line - 0.11% (6)
report - 0.11% (6)
thresh - 0.11% (6)
first - 0.11% (6)
here’s - 0.11% (6)
old - 0.11% (6)
found - 0.11% (6)
published - 0.11% (6)
could - 0.11% (6)
case - 0.11% (6)
(11) - 0.11% (6)
evidence - 0.11% (6)
committee - 0.11% (6)
(12) - 0.11% (6)
current - 0.11% (6)
finding - 0.11% (6)
hand - 0.09% (5)
identifier - 0.09% (5)
tinkering - 0.09% (5)
(that - 0.09% (5)
individuals - 0.09% (5)
tagged - 0.09% (5)
responses - 0.09% (5)
foi - 0.09% (5)
public - 0.09% (5)
lords - 0.09% (5)
answers - 0.09% (5)
actual - 0.09% (5)
scripts - 0.09% (5)
leave - 0.09% (5)
apis - 0.09% (5)
you’re - 0.09% (5)
differ - 0.09% (5)
resources - 0.09% (5)
hack - 0.09% (5)
libraries - 0.09% (5)
fuzzyset - 0.09% (5)
query - 0.09% (5)
will - 0.09% (5)
would - 0.09% (5)
into - 0.09% (5)
(19) - 0.09% (5)
visual - 0.09% (5)
output - 0.09% (5)
along - 0.09% (5)
sets - 0.09% (5)
spreadsheet - 0.09% (5)
statement - 0.09% (5)
cleaning - 0.09% (5)
posts - 0.09% (5)
download - 0.09% (5)
gets - 0.09% (5)
x['cluster'] - 0.09% (5)
results - 0.09% (5)
whether - 0.09% (5)
).cumsum() - 0.09% (5)
back - 0.09% (5)
level - 0.09% (5)
mps - 0.09% (5)
unique - 0.09% (5)
rows - 0.09% (5)
deposited - 0.09% (5)
registers - 0.09% (5)
something - 0.09% (5)
set, - 0.09% (5)
collisions - 0.09% (5)
they’re - 0.07% (4)
path - 0.07% (4)
running - 0.07% (4)
drop - 0.07% (4)
fragment - 0.07% (4)
groups[-1].append(x) - 0.07% (4)
groups[-1][-1]) - 0.07% (4)
couple - 0.07% (4)
numbers - 0.07% (4)
process - 0.07% (4)
http://qna.files.parliament.uk/qna-attachments - 0.07% (4)
(9) - 0.07% (4)
email - 0.07% (4)
things - 0.07% (4)
different - 0.07% (4)
consider - 0.07% (4)
taken - 0.07% (4)
trying - 0.07% (4)
(22) - 0.07% (4)
102, - 0.07% (4)
above - 0.07% (4)
now, - 0.07% (4)
digital - 0.07% (4)
(13) - 0.07% (4)
brown - 0.07% (4)
notebook - 0.07% (4)
100, - 0.07% (4)
jupyter - 0.07% (4)
(8) - 0.07% (4)
dataframe - 0.07% (4)
subdomain - 0.07% (4)
134, - 0.07% (4)
news - 0.07% (4)
various - 0.07% (4)
say - 0.07% (4)
transparency - 0.07% (4)
request - 0.07% (4)
party - 0.07% (4)
api, - 0.07% (4)
tweak - 0.07% (4)
inmaxrange - 0.07% (4)
artist - 0.07% (4)
true - 0.07% (4)
publishing - 0.07% (4)
idea - 0.07% (4)
under - 0.07% (4)
(14) - 0.07% (4)
reveal - 0.07% (4)
cleaner - 0.07% (4)
colour - 0.07% (4)
without - 0.07% (4)
media - 0.07% (4)
james - 0.07% (4)
useful - 0.07% (4)
identify - 0.07% (4)
message - 0.07% (4)
ago - 0.07% (4)
m[0] - 0.07% (4)
node - 0.07% (4)
sure - 0.07% (4)
maxgap, - 0.07% (4)
case, - 0.07% (4)
statements - 0.07% (4)
uri - 0.07% (4)
business - 0.07% (4)
looking - 0.07% (4)
exact - 0.07% (4)
often - 0.07% (4)
play - 0.07% (4)
features - 0.07% (4)
last - 0.07% (4)
its - 0.07% (4)
datasets - 0.07% (4)
scraper - 0.07% (4)
made - 0.07% (4)
full - 0.07% (4)
grouping - 0.05% (3)
office - 0.05% (3)
rally - 0.05% (3)
basis - 0.05% (3)
making - 0.05% (3)
bit - 0.05% (3)
nearly - 0.05% (3)
dataset, - 0.05% (3)
e.g. - 0.05% (3)
i’ll - 0.05% (3)
extent - 0.05% (3)
once - 0.05% (3)
wanting - 0.05% (3)
competitive - 0.05% (3)
led - 0.05% (3)
anyone - 0.05% (3)
presumably - 0.05% (3)
points - 0.05% (3)
against - 0.05% (3)
registered - 0.05% (3)
service - 0.05% (3)
scraped - 0.05% (3)
director - 0.05% (3)
seem - 0.05% (3)
effort - 0.05% (3)
appointments - 0.05% (3)
csv - 0.05% (3)
working - 0.05% (3)
speech - 0.05% (3)
insolvency - 0.05% (3)
hold - 0.05% (3)
local - 0.05% (3)
lot - 0.05% (3)
sharing - 0.05% (3)
105, - 0.05% (3)
[3.9], - 0.05% (3)
in. - 0.05% (3)
data.sort() - 0.05% (3)
[[data[0]]] - 0.05% (3)
data[1:]: - 0.05% (3)
else: - 0.05% (3)
groups.append([x]) - 0.05% (3)
[[2.1, - 0.05% (3)
[4.6], - 0.05% (3)
(x[col].diff()>=maxgap).cumsum() - 0.05% (3)
[7.4]] - 0.05% (3)
it, - 0.05% (3)
pull - 0.05% (3)
range: - 0.05% (3)
trythis - 0.05% (3)
mentioned - 0.05% (3)
x=x.sort_values(col) - 0.05% (3)
;-) - 0.05% (3)
right? - 0.05% (3)
widgets - 0.05% (3)
listed - 0.05% (3)
test - 0.05% (3)
still - 0.05% (3)
hours - 0.05% (3)
why - 0.05% (3)
pop - 0.05% (3)
smith', - 0.05% (3)
ouseful - 0.05% (3)
import - 0.05% (3)
possibly - 0.05% (3)
did - 0.05% (3)
'john - 0.05% (3)
parlidata - 0.05% (3)
here, - 0.05% (3)
probably - 0.05% (3)
quick - 0.05% (3)
(7) - 0.05% (3)
_about - 0.05% (3)
actually - 0.05% (3)
links - 0.05% (3)
through - 0.05% (3)
api. - 0.05% (3)
perhaps - 0.05% (3)
match, - 0.05% (3)
engine - 0.05% (3)
simple - 0.05% (3)
again, - 0.05% (3)
calendar - 0.05% (3)
developers - 0.05% (3)
civic - 0.05% (3)
hackers - 0.05% (3)
times, - 0.05% (3)
me) - 0.05% (3)
this: - 0.05% (3)
spreadsheets - 0.05% (3)
doing - 0.05% (3)
(in - 0.05% (3)
score - 0.05% (3)
know, - 0.05% (3)
select - 0.05% (3)
browser - 0.05% (3)
(the - 0.05% (3)
declaration - 0.05% (3)
extract - 0.05% (3)
most - 0.05% (3)
those - 0.05% (3)
either - 0.05% (3)
give - 0.05% (3)
items - 0.05% (3)
urls - 0.05% (3)
(16) - 0.05% (3)
(or - 0.05% (3)
(6) - 0.05% (3)
(24) - 0.05% (3)
rmd - 0.05% (3)
website, - 0.05% (3)
relating - 0.05% (3)
limit - 0.05% (3)
cleaned - 0.05% (3)
advantage - 0.05% (3)
edge - 0.05% (3)
subject - 0.05% (3)
organisations - 0.05% (3)
spend - 0.05% (3)
generated - 0.05% (3)
order - 0.05% (3)
related - 0.05% (3)
stories - 0.05% (3)
workflow - 0.05% (3)
(26) - 0.05% (3)
journalism - 0.05% (3)
attachments. - 0.05% (3)
rooted - 0.05% (3)
similar - 0.05% (3)
explore.data.parliament.uk - 0.05% (3)
includes - 0.05% (3)
uin - 0.05% (3)
date - 0.05% (3)
@psychemedia - 0.05% (3)
there’s - 0.05% (3)
doesn’t - 0.05% (3)
sometimes - 0.05% (3)
it’s - 0.05% (3)
answered - 0.05% (3)
although - 0.05% (3)
it. - 0.05% (3)
experimenting - 0.05% (3)
tech - 0.05% (3)
flows - 0.05% (3)
details - 0.05% (3)
spending - 0.04% (2)
refine - 0.04% (2)
company - 0.04% (2)
control - 0.04% (2)
commission - 0.04% (2)
(15) - 0.04% (2)
referred - 0.04% (2)
donations - 0.04% (2)
candidate - 0.04% (2)
on… - 0.04% (2)
psc - 0.04% (2)
archives - 0.04% (2)
person - 0.04% (2)
former - 0.04% (2)
merging - 0.04% (2)
however - 0.04% (2)
(acoba - 0.04% (2)
benefit - 0.04% (2)
messy - 0.04% (2)
up, - 0.04% (2)
chance - 0.04% (2)
change - 0.04% (2)
identified - 0.04% (2)
revealing - 0.04% (2)
subjects - 0.04% (2)
tip - 0.04% (2)
everyone - 0.04% (2)
least - 0.04% (2)
analyse - 0.04% (2)
presentations - 0.04% (2)
share - 0.04% (2)
working, - 0.04% (2)
lest - 0.04% (2)
hide - 0.04% (2)
believe - 0.04% (2)
seeing - 0.04% (2)
(maybe - 0.04% (2)
competitors - 0.04% (2)
reuse - 0.04% (2)
improve - 0.04% (2)
examples - 0.04% (2)
enter - 0.04% (2)
much - 0.04% (2)
release - 0.04% (2)
surveys - 0.04% (2)
embedded - 0.04% (2)
calendars - 0.04% (2)
makes - 0.04% (2)
having - 0.04% (2)
seven - 0.04% (2)
is. - 0.04% (2)
spot - 0.04% (2)
instead - 0.04% (2)
live - 0.04% (2)
don't - 0.04% (2)
subscribe - 0.04% (2)
university - 0.04% (2)
nothing - 0.04% (2)
(search - 0.04% (2)
polls - 0.04% (2)
churnalism - 0.04% (2)
reality - 0.04% (2)
design - 0.04% (2)
game - 0.04% (2)
worlds - 0.04% (2)
reader - 0.04% (2)
receive - 0.04% (2)
department - 0.04% (2)
ouseful.info, - 0.04% (2)
forms - 0.04% (2)
struck - 0.04% (2)
top - 0.04% (2)
line, - 0.04% (2)
powered - 0.04% (2)
webpage - 0.04% (2)
promote - 0.04% (2)
messages - 0.04% (2)
engage - 0.04% (2)
somewhere - 0.04% (2)
that’s - 0.04% (2)
activity - 0.04% (2)
tools, - 0.04% (2)
(an - 0.04% (2)
elsewhere - 0.04% (2)
terms… - 0.04% (2)
talking - 0.04% (2)
clustering - 0.04% (2)
value_counts() - 0.04% (2)
or, - 0.04% (2)
(data) - 0.04% (2)
column, - 0.04% (2)
comments. - 0.04% (2)
days - 0.04% (2)
easy - 0.04% (2)
apis, - 0.04% (2)
response, - 0.04% (2)
build - 0.04% (2)
settings - 0.04% (2)
together - 0.04% (2)
apis: - 0.04% (2)
javascript - 0.04% (2)
erm… - 0.04% (2)
source, - 0.04% (2)
uncourse - 0.04% (2)
got - 0.04% (2)
pps - 0.04% (2)
magic - 0.04% (2)
meet - 0.04% (2)
provides - 0.04% (2)
console - 0.04% (2)
duplicate - 0.04% (2)
thresh: - 0.04% (2)
linked - 0.04% (2)
seems - 0.04% (2)
supporting - 0.04% (2)
(such - 0.04% (2)
question, - 0.04% (2)
datafile - 0.04% (2)
logs - 0.04% (2)
disclosure - 0.04% (2)
quite - 0.04% (2)
online - 0.04% (2)
reports, - 0.04% (2)
whatdotheyknow - 0.04% (2)
(e.g. - 0.04% (2)
datastore - 0.04% (2)
omitted - 0.04% (2)
display - 0.04% (2)
tracking - 0.04% (2)
education - 0.04% (2)
technologies - 0.04% (2)
documents, - 0.04% (2)
documents. - 0.04% (2)
threshold - 0.04% (2)
contains - 0.04% (2)
below - 0.04% (2)
partial - 0.04% (2)
help - 0.04% (2)
entry - 0.04% (2)
free - 0.04% (2)
nationalities - 0.04% (2)
supplied - 0.04% (2)
likely - 0.04% (2)
nice - 0.04% (2)
pages - 0.04% (2)
specifically - 0.04% (2)
results) - 0.04% (2)
http://researchbriefings.files.parliament.uk/ - 0.04% (2)
http://data.parliament.uk/depositedpapers/files/ - 0.04% (2)
http://qna.files.parliament.uk/ws-attachments - 0.04% (2)
(appear - 0.04% (2)
https://www.parliament.uk/documents/foi/ - 0.04% (2)
site:parliament.uk - 0.04% (2)
migration - 0.04% (2)
uk. - 0.04% (2)
(at - 0.04% (2)
basic - 0.04% (2)
financial - 0.04% (2)
by, - 0.04% (2)
i’ve - 0.04% (2)
interests. - 0.04% (2)
two - 0.04% (2)
picked - 0.04% (2)
logical - 0.04% (2)
(x[col].diff()>=maxgap)) - 0.04% (2)
interests; - 0.04% (2)
x.drop('cdiff',1) - 0.04% (2)
((x.groupby('cluster')['cdiff'].cumsum()>maxrange) - 0.04% (2)
x['cdiff']=x.groupby('cluster')[col].diff() - 0.04% (2)
based - 0.04% (2)
fragments - 0.04% (2)
cluster3([2.1,7.4,3.9,4.6,2.5,2.4,2.52],0.35,0.41,0.25) - 0.04% (2)
false - 0.04% (2)
inmaxminrange - 0.04% (2)
bulk - 0.04% (2)
journalists’ - 0.04% (2)
away - 0.04% (2)
whilst - 0.04% (2)
currently - 0.04% (2)
ministers’ - 0.04% (2)
government - 0.04% (2)
others - 0.04% (2)
registers, - 0.04% (2)
interest. - 0.04% (2)
division - 0.04% (2)
debate - 0.04% (2)
declarations - 0.04% (2)
place - 0.04% (2)
named - 0.04% (2)
speaking - 0.04% (2)
words - 0.04% (2)
sentiment - 0.04% (2)
hansard - 0.04% (2)
recently - 0.04% (2)
evan - 0.04% (2)
associations - 0.04% (2)
specified - 0.04% (2)
really - 0.04% (2)
interactive - 0.04% (2)
produces - 0.04% (2)
language - 0.04% (2)
versions - 0.04% (2)
notebooks - 0.04% (2)
python/pandas - 0.04% (2)
regions - 0.04% (2)
label - 0.04% (2)
default - 0.04% (2)
uses - 0.04% (2)
here. - 0.04% (2)
final - 0.04% (2)
htmlwidgets - 0.04% (2)
rcharts - 0.04% (2)
wrap - 0.04% (2)
packages - 0.04% (2)
googlevis - 0.04% (2)
diagrams, - 0.04% (2)
natural - 0.04% (2)
outputs - 0.04% (2)
tried - 0.04% (2)
provided - 0.04% (2)
argue - 0.04% (2)
cluster([1, - 0.04% (2)
<=maxrange - 0.04% (2)
abs(x-groups[-1][0]) - 0.04% (2)
''' - 0.04% (2)
139, - 0.04% (2)
99, - 0.04% (2)
[134, - 0.04% (2)
[[1, - 0.04% (2)
maxgap=10) - 0.04% (2)
maxgap): - 0.04% (2)
future - 0.04% (2)
stage - 0.04% (2)
them, - 0.04% (2)
correspond - 0.04% (2)
interesting - 0.04% (2)
level, - 0.04% (2)
wrc - 0.04% (2)
15, - 0.04% (2)
want, - 0.04% (2)
wordpress.com. - 0.04% (2)
of the - 0.53% (29)
the data - 0.4% (22)
that i - 0.33% (18)
in the - 0.31% (17)
on the - 0.29% (16)
to the - 0.27% (15)
at the - 0.25% (14)
from the - 0.25% (14)
that a - 0.24% (13)
you want - 0.22% (12)
that is - 0.2% (11)
and the - 0.2% (11)
can be - 0.2% (11)
sankey diagram - 0.2% (11)
a data - 0.18% (10)
the same - 0.18% (10)
if you - 0.16% (9)
tony hirst - 0.16% (9)
or the - 0.16% (9)
the parliament - 0.16% (9)
by the - 0.15% (8)
written questions - 0.15% (8)
for example, - 0.15% (8)
posted in - 0.13% (7)
written by - 0.13% (7)
a particular - 0.13% (7)
for the - 0.13% (7)
anything you - 0.13% (7)
such as - 0.13% (7)
by tony - 0.13% (7)
want to - 0.13% (7)
that are - 0.13% (7)
data file - 0.13% (7)
the following - 0.13% (7)
we can - 0.11% (6)
one of - 0.11% (6)
from a - 0.11% (6)
register of - 0.11% (6)
the names - 0.11% (6)
the original - 0.11% (6)
as well - 0.11% (6)
sankey diagrams - 0.11% (6)
there is - 0.11% (6)
example of - 0.11% (6)
well as - 0.11% (6)
you can - 0.11% (6)
and an - 0.11% (6)
associated with - 0.11% (6)
close to - 0.11% (6)
may be - 0.11% (6)
ways of - 0.11% (6)
data is - 0.09% (5)
to find - 0.09% (5)
tagged with - 0.09% (5)
out the - 0.09% (5)
look at - 0.09% (5)
that can - 0.09% (5)
data files - 0.09% (5)
with parliament - 0.09% (5)
be found - 0.09% (5)
around a - 0.09% (5)
(that is, - 0.09% (5)
be used - 0.09% (5)
an api - 0.09% (5)
to have - 0.07% (4)
with a - 0.07% (4)
we could - 0.07% (4)
written answers - 0.07% (4)
a couple - 0.07% (4)
as the - 0.07% (4)
use the - 0.07% (4)
near matches - 0.07% (4)
the parliamentary - 0.07% (4)
a possible - 0.07% (4)
you may - 0.07% (4)
the code - 0.07% (4)
that the - 0.07% (4)
about the - 0.07% (4)
sources of - 0.07% (4)
in this - 0.07% (4)
commons written - 0.07% (4)
and then - 0.07% (4)
[2.1, 2.4, - 0.07% (4)
couple of - 0.07% (4)
with the - 0.07% (4)
is not - 0.07% (4)
to look - 0.07% (4)
to use - 0.07% (4)
trying to - 0.07% (4)
able to - 0.07% (4)
list of - 0.07% (4)
try to - 0.07% (4)
look for - 0.07% (4)
looking at - 0.07% (4)
2014 (5) - 0.07% (4)
none else - 0.07% (4)
in data - 0.07% (4)
files are - 0.07% (4)
via the - 0.07% (4)
research briefings - 0.07% (4)
100, 102, - 0.07% (4)
an example - 0.07% (4)
a dataset - 0.07% (4)
would be - 0.07% (4)
be the - 0.07% (4)
file attachments - 0.07% (4)
names of - 0.07% (4)
hirst leave - 0.07% (4)
a comment - 0.07% (4)
is that - 0.07% (4)
who do - 0.07% (4)
is none - 0.05% (3)
[[2.1, 2.4, - 0.05% (3)
[3.9], [4.6], - 0.05% (3)
return groups - 0.05% (3)
which i - 0.05% (3)
maybe i - 0.05% (3)
groups = - 0.05% (3)
[[data[0]]] for - 0.05% (3)
also be - 0.05% (3)
2.5, 2.52] - 0.05% (3)
2.52], [3.9], - 0.05% (3)
here’s an - 0.05% (3)
else: groups.append([x]) - 0.05% (3)
groups.append([x]) return - 0.05% (3)
abs(x - - 0.05% (3)
as data - 0.05% (3)
groups[-1][-1]) <= - 0.05% (3)
all the - 0.05% (3)
[4.6], [7.4]] - 0.05% (3)
we might - 0.05% (3)
).cumsum() return - 0.05% (3)
be able - 0.05% (3)
or that - 0.05% (3)
2.4, 2.5, - 0.05% (3)
groups[-1].append(x) else: - 0.05% (3)
basis for - 0.05% (3)
the flow - 0.05% (3)
information about - 0.05% (3)
case, we - 0.05% (3)
way in - 0.05% (3)
the above - 0.05% (3)
used to - 0.05% (3)
of data - 0.05% (3)
have a - 0.05% (3)
more than - 0.05% (3)
is here - 0.05% (3)
wanting to - 0.05% (3)
using the - 0.05% (3)
– an - 0.05% (3)
return x - 0.05% (3)
like this: - 0.05% (3)
data.sort() groups - 0.05% (3)
= [[data[0]]] - 0.05% (3)
in data[1:]: - 0.05% (3)
diagrams in - 0.05% (3)
with sankey - 0.05% (3)
- groups[-1][-1]) - 0.05% (3)
the group - 0.05% (3)
x=x.sort_values(col) x['cluster'] - 0.05% (3)
tinkering with - 0.05% (3)
one way - 0.05% (3)
(9) september - 0.05% (3)
written statements - 0.05% (3)
a story - 0.05% (3)
2015 (13) - 0.05% (3)
web page - 0.05% (3)
where the - 0.05% (3)
a count - 0.05% (3)
(12) november - 0.05% (3)
and civic - 0.05% (3)
deposited papers - 0.05% (3)
in which - 0.05% (3)
data and - 0.05% (3)
the current - 0.05% (3)
example, if - 0.05% (3)
search for - 0.05% (3)
and python - 0.05% (3)
experimenting with - 0.05% (3)
if there - 0.05% (3)
how to - 0.05% (3)
to try - 0.05% (3)
the numeric - 0.05% (3)
hours ago - 0.05% (3)
want tagged - 0.05% (3)
is available - 0.05% (3)
looks like - 0.05% (3)
web search - 0.05% (3)
that you - 0.05% (3)
in your - 0.05% (3)
= (x[col].diff()>=maxgap).cumsum() - 0.05% (3)
rooted on - 0.05% (3)
2012 (23) - 0.05% (3)
data cleaning - 0.05% (3)
2011 (20) - 0.05% (3)
they are - 0.05% (3)
to other - 0.05% (3)
there are - 0.05% (3)
x['cdiff']=x.groupby('cluster')[col].diff() x['cluster'] - 0.04% (2)
google calendars - 0.04% (2)
2010 (23) - 0.04% (2)
the by, - 0.04% (2)
(17) march - 0.04% (2)
google spreadsheet - 0.04% (2)
2013 (10) - 0.04% (2)
2010 (17) - 0.04% (2)
statements of - 0.04% (2)
using google - 0.04% (2)
parliamentary written - 0.04% (2)
questions and - 0.04% (2)
2009 (20) - 0.04% (2)
data apis: - 0.04% (2)
their own - 0.04% (2)
2009 (18) - 0.04% (2)
hackers on - 0.04% (2)
age at - 0.04% (2)
data from - 0.04% (2)
available as - 0.04% (2)
(11) february - 0.04% (2)
| (x[col].diff()>=maxgap)) - 0.04% (2)
to generate - 0.04% (2)
(17) may - 0.04% (2)
15, 2017 - 0.04% (2)
2013 (8) - 0.04% (2)
2015 (12) - 0.04% (2)
lords and - 0.04% (2)
on data - 0.04% (2)
(14) january - 0.04% (2)
scraped from - 0.04% (2)
2016 (12) - 0.04% (2)
(7) may - 0.04% (2)
of interests - 0.04% (2)
return x.drop('cdiff',1) - 0.04% (2)
2011 (18) - 0.04% (2)
(20) september - 0.04% (2)
2016 (10) - 0.04% (2)
2011 (17) - 0.04% (2)
(11) july - 0.04% (2)
the time - 0.04% (2)
think i - 0.04% (2)
to developers - 0.04% (2)
by former - 0.04% (2)
that comes - 0.04% (2)
found via - 0.04% (2)
the insolvency - 0.04% (2)
form of - 0.04% (2)
with an - 0.04% (2)
listed as - 0.04% (2)
then you - 0.04% (2)
and again - 0.04% (2)
to act - 0.04% (2)
of individuals - 0.04% (2)
taken up - 0.04% (2)
and as - 0.04% (2)
into the - 0.04% (2)
you might - 0.04% (2)
cleaned dataset - 0.04% (2)
there may - 0.04% (2)
also provide - 0.04% (2)
to process - 0.04% (2)
to which - 0.04% (2)
the data. - 0.04% (2)
story point - 0.04% (2)
the extent - 0.04% (2)
and analysis - 0.04% (2)
share their - 0.04% (2)
but who - 0.04% (2)
house api - 0.04% (2)
the companies - 0.04% (2)
r code - 0.04% (2)
other sources - 0.04% (2)
use that - 0.04% (2)
instead of - 0.04% (2)
to pull - 0.04% (2)
the story - 0.04% (2)
8 hours - 0.04% (2)
declarations of - 0.04% (2)
to them - 0.04% (2)
of this - 0.04% (2)
uk university - 0.04% (2)
churnalism times - 0.04% (2)
can then - 0.04% (2)
digital worlds - 0.04% (2)
worlds blog - 0.04% (2)
when it - 0.04% (2)
of new - 0.04% (2)
to this - 0.04% (2)
any of - 0.04% (2)
follow @psychemedia - 0.04% (2)
because a - 0.04% (2)
for an - 0.04% (2)
the press - 0.04% (2)
where a - 0.04% (2)
a source - 0.04% (2)
whether a - 0.04% (2)
(such as - 0.04% (2)
a person - 0.04% (2)
ouseful.info, the - 0.04% (2)
jupyter notebooks - 0.04% (2)
times that - 0.04% (2)
the thresh - 0.04% (2)
name to - 0.04% (2)
and add - 0.04% (2)
a match - 0.04% (2)
thresh: # - 0.04% (2)
below the - 0.04% (2)
possible match, - 0.04% (2)
may include - 0.04% (2)
values to - 0.04% (2)
to help - 0.04% (2)
matches in - 0.04% (2)
finding near - 0.04% (2)
cleaning – - 0.04% (2)
with parlidata - 0.04% (2)
likely to - 0.04% (2)
add the - 0.04% (2)
times, you - 0.04% (2)
of these - 0.04% (2)
on their - 0.04% (2)
see what - 0.04% (2)
activity and - 0.04% (2)
way is - 0.04% (2)
find it - 0.04% (2)
api is - 0.04% (2)
own terms… - 0.04% (2)
civic hackers - 0.04% (2)
this is - 0.04% (2)
developers and - 0.04% (2)
talking to - 0.04% (2)
want march - 0.04% (2)
2 comments - 0.04% (2)
the pandas - 0.04% (2)
list to - 0.04% (2)
some of - 0.04% (2)
support for - 0.04% (2)
struck me - 0.04% (2)
the way - 0.04% (2)
the actual - 0.04% (2)
details about - 0.04% (2)
answers api - 0.04% (2)
the written - 0.04% (2)
are posted - 0.04% (2)
datafile attachments - 0.04% (2)
seems to - 0.04% (2)
the url - 0.04% (2)
along the - 0.04% (2)
a query - 0.04% (2)
parliament website, - 0.04% (2)
with parliamentary - 0.04% (2)
files associated - 0.04% (2)
down data - 0.04% (2)
something like - 0.04% (2)
often come - 0.04% (2)
think the - 0.04% (2)
and may - 0.04% (2)
api results) - 0.04% (2)
(appear in - 0.04% (2)
the uri - 0.04% (2)
links to - 0.04% (2)
– the - 0.04% (2)
are published - 0.04% (2)
published by - 0.04% (2)
attachments. the - 0.04% (2)
documents are - 0.04% (2)
written evidence - 0.04% (2)
a range - 0.04% (2)
supporting documents, - 0.04% (2)
don’t see - 0.04% (2)
appear to - 0.04% (2)
the web - 0.04% (2)
give a - 0.04% (2)
groups of - 0.04% (2)
march 15, - 0.04% (2)
a basis - 0.04% (2)
data that - 0.04% (2)
engage with - 0.04% (2)
way to - 0.04% (2)
are nearly - 0.04% (2)
numbers that - 0.04% (2)
to say - 0.04% (2)
maxgap=10) [[1, - 0.04% (2)
a first - 0.04% (2)
used as - 0.04% (2)
out that - 0.04% (2)
– and - 0.04% (2)
data journalism - 0.04% (2)
provide a - 0.04% (2)
things that - 0.04% (2)
6, 9], - 0.04% (2)
sankey diagrams, - 0.04% (2)
= true - 0.04% (2)
whether we - 0.04% (2)
a number - 0.04% (2)
number that - 0.04% (2)
maxgap and - 0.04% (2)
else abs(x-groups[-1][0]) - 0.04% (2)
if maxrange - 0.04% (2)
data[1:]: inmaxrange - 0.04% (2)
cluster([1, 6, - 0.04% (2)
a group - 0.04% (2)
number in - 0.04% (2)
the last - 0.04% (2)
of any - 0.04% (2)
2.5, 2.52], - 0.04% (2)
if abs(x - 0.04% (2)
the rmd - 0.04% (2)
for generating - 0.04% (2)
to engage - 0.04% (2)
parliament data - 0.04% (2)
a question - 0.04% (2)
of commons - 0.04% (2)
the house - 0.04% (2)
a quick - 0.04% (2)
and parliamentary - 0.04% (2)
apis: commons - 0.04% (2)
an api, - 0.04% (2)
numeric identifier - 0.04% (2)
the most - 0.04% (2)
idea of - 0.04% (2)
ps for - 0.04% (2)
why not - 0.04% (2)
to promote - 0.04% (2)
api, or - 0.04% (2)
sort of - 0.04% (2)
there’s no - 0.04% (2)
produces a - 0.04% (2)
data analysis - 0.04% (2)
a country - 0.04% (2)
relating to - 0.04% (2)
use as - 0.04% (2)
with each - 0.04% (2)
version of - 0.04% (2)
though i - 0.04% (2)
me about - 0.04% (2)
the commons - 0.04% (2)
the uk - 0.04% (2)
between the - 0.04% (2)
this case, - 0.04% (2)
that we - 0.04% (2)
idea is - 0.04% (2)
parliamentary questions - 0.04% (2)
blog at - 0.04% (2)
in anything you - 0.13% (7)
written by tony - 0.13% (7)
posted in anything - 0.13% (7)
anything you want - 0.13% (7)
by tony hirst - 0.13% (7)
as well as - 0.11% (6)
can be found - 0.09% (5)
one of the - 0.09% (5)
a comment posted - 0.07% (4)
commons written questions - 0.07% (4)
leave a comment - 0.07% (4)
tony hirst leave - 0.07% (4)
hirst leave a - 0.07% (4)
comment posted in - 0.07% (4)
a couple of - 0.07% (4)
the data is - 0.07% (4)
an example of - 0.07% (4)
written questions api - 0.05% (3)
to try to - 0.05% (3)
experimenting with sankey - 0.05% (3)
diagrams in r - 0.05% (3)
[2.1, 2.4, 2.5, - 0.05% (3)
x['cluster'] = (x[col].diff()>=maxgap).cumsum() - 0.05% (3)
x=x.sort_values(col) x['cluster'] = - 0.05% (3)
groups.append([x]) return groups - 0.05% (3)
= [[data[0]]] for - 0.05% (3)
x in data[1:]: - 0.05% (3)
groups[-1].append(x) else: groups.append([x]) - 0.05% (3)
2.52], [3.9], [4.6], - 0.05% (3)
close to the - 0.05% (3)
data.sort() groups = - 0.05% (3)
[[data[0]]] for x - 0.05% (3)
the names of - 0.05% (3)
groups[-1][-1]) <= maxgap - 0.05% (3)
there is a - 0.05% (3)
of a particular - 0.05% (3)
you want tagged - 0.05% (3)
a web search - 0.05% (3)
for example, if - 0.05% (3)
be able to - 0.05% (3)
comments posted in - 0.05% (3)
if there is - 0.05% (3)
looking at the - 0.05% (3)
their own terms… - 0.04% (2)
inmaxrange = true - 0.04% (2)
civic hackers on - 0.04% (2)
to developers and - 0.04% (2)
x['cdiff']=x.groupby('cluster')[col].diff() x['cluster'] = - 0.04% (2)
if maxrange is - 0.04% (2)
a number that - 0.04% (2)
2.5, 2.52], [3.9], - 0.04% (2)
tinkering with parliament - 0.04% (2)
times that are - 0.04% (2)
data apis: commons - 0.04% (2)
digital worlds blog - 0.04% (2)
written questions and - 0.04% (2)
appointments taken up - 0.04% (2)
= (x[col].diff()>=maxgap).cumsum() x['cdiff']=x.groupby('cluster')[col].diff() - 0.04% (2)
from the parliament - 0.04% (2)
wanting to be - 0.04% (2)
when the data - 0.04% (2)
maxrange is none - 0.04% (2)
be used as - 0.04% (2)
the extent to - 0.04% (2)
the insolvency service - 0.04% (2)
companies house api - 0.04% (2)
declarations of interest - 0.04% (2)
up by former - 0.04% (2)
from the data - 0.04% (2)
there may be - 0.04% (2)
age at the - 0.04% (2)
the house of - 0.04% (2)
available as a - 0.04% (2)
scraped from the - 0.04% (2)
picked up a - 0.04% (2)
data files associated - 0.04% (2)
look at a - 0.04% (2)
= true if - 0.04% (2)
a possible match, - 0.04% (2)
an api, or - 0.04% (2)
struck me that - 0.04% (2)
if you want - 0.04% (2)
on their own - 0.04% (2)
and civic hackers - 0.04% (2)
talking to developers - 0.04% (2)
tony hirst 2 - 0.04% (2)
is to use - 0.04% (2)
times, you may - 0.04% (2)
a list of - 0.04% (2)
and parliamentary written - 0.04% (2)
finding near matches - 0.04% (2)
data cleaning – - 0.04% (2)
tagged with parlidata - 0.04% (2)
found via the - 0.04% (2)
there is no - 0.04% (2)
the data files - 0.04% (2)
often come with - 0.04% (2)
written answers api - 0.04% (2)
the parliament website, - 0.04% (2)
parliament data apis: - 0.04% (2)
a sankey diagram - 0.04% (2)
in data[1:]: inmaxrange - 0.04% (2)
to engage with - 0.04% (2)
last number in - 0.04% (2)
[[2.1, 2.4, 2.5, - 0.04% (2)
if abs(x - - 0.04% (2)
99, 100, 102, - 0.04% (2)
cluster([1, 6, 9, - 0.04% (2)
[[1, 6, 9], - 0.04% (2)
with parliamentary business - 0.04% (2)
a basis for - 0.04% (2)
in the data - 0.04% (2)
nearly the same - 0.04% (2)
house of commons - 0.04% (2)
numbers that are - 0.04% (2)
anything you want, - 0.04% (2)
can be used - 0.04% (2)
is here and - 0.04% (2)
here and a - 0.04% (2)
sankey diagram demo - 0.04% (2)
the commons written - 0.04% (2)
the parliamentary questions - 0.04% (2)
for a particular - 0.04% (2)
parliamentary written answers - 0.04% (2)

Here you can find chart of all your popular one, two and three word phrases. Google and others search engines means your page is about words you use frequently.

Copyright © 2015-2016 hupso.pl. All rights reserved. FB | +G | Twitter

Hupso.pl jest serwisem internetowym, w którym jednym kliknieciem możesz szybko i łatwo sprawdź stronę www pod kątem SEO. Oferujemy darmowe pozycjonowanie stron internetowych oraz wycena domen i stron internetowych. Prowadzimy ranking polskich stron internetowych oraz ranking stron alexa.