Four short links: 6 February 2020
Identifying Doctored Images, Demonstrating ML, Radioactive Data, and Search All Your Things
- Assembler — Google’s Jigsaw group is releasing a tool to spot faked/doctored images.
- Demonstrating Machine Learning with Starbursts — simple demonstration for kids that illustrates “learning” to perform a task without explicit programming to accomplish that task.
- Using “Radioactive Data” to Detect Whether a Data Set Was Used For Training (Facebook AI) — We call this new verification method “radioactive” data because it is analogous to the use of radioactive markers in medicine: drugs such as barium sulphate allow doctors to see certain conditions more clearly on computerized tomography (CT) scans or other X-ray exams. We introduce unique marks that are harmless and have no impact on the classification accuracy of models, but remain present through the learning process and are detectable with high confidence in a neural network. Our method provides a level of confidence (p-value) that a radioactive data set was used to train a particular model.
- ripgrep-all — ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.