以åã«åç¥ãããšãããæ€çŽ¢é åãšæ©æ¢°åŠç¿é åã§é¢çœãã£ãèšäºãããã§ç޹ä»ããŠãããŸãã ã¬ãããªéãæžãããã«æºã蟌ããããäžå®éæºãŸã£ããé ä¿¡ããã»ãããæžãåŽãšããŠãæ°ã楜ãªã®ã§æ«ããã®æ¹éã§è¡ã£ãŠã¿ãŸãã
Articles
Search
elastic/elasticsearch - Integrate ANN search #78473
elasticsearch 8.x ã§ã¯ Lucene 9.0 ããæäŸããã ANN(è¿äŒŒè¿åæ¢çŽ¢)æ©èœãæäŸãããäºå®ã以åããæäŸãããŠãã elasticsearch ã® exact k NN ã¯ãããã¯ã·ã§ã³ç°å¢äžã§ã¯äœ¿ããªãæ§èœã ã£ãããLucene ã® ANN æ€çŽ¢ã¯çµæãã©ããªããéåžžã«æ°ã«ãªããES ã«çµã¿èŸŒãŸããããšã§ãANN ã®çµæã«å¯ŸããŠããã£ã«ã¿ãªã³ã°ããã€ããªããæ€çŽ¢ãªã© ES ã®æ€çŽ¢ãšã³ãžã³ãšçµã¿åãããå©çšãæåŸ ã§ããã®ã§è¿œã£ãŠããããã
Solr ãåãã ANN ã®æäŸãæºåäžã
å人çã«ã¯ãANN ã®ããšã kNN ãšæžãã®ã¯çŽããããã®ã§ãããŠã»ããããANN ã®ããšã Neural Search ãšèªãã§ããŠãã?ããã®??ãšæã£ããããã
ãããå®è£ ãããã°ãVespaããå OSS ããšã³ã¿ãŒãã©ã€ãºã®è¿äŒŒè¿åæ¢çŽ¢ãšã³ãžã³ãã©ããªã£ãŠããã®ãã¯éåžžã«é¢çœãæªæ¥ã§ããã
Reddit ãæ€çŽ¢ API ãå·æ°ããã話ã ã·ã¹ãã çã«ã©ãå¬ãããããããŒã ç·šæãå€ãã£ãŠã10 幎éã§ã€ã³ãã©ãšã³ãžãã¢ãå Œæ¥ã§éçºããŠããç¶æ ãããæ€çŽ¢ãšã³ãžãã¢ãå°ä»»ã®ã¯ã©ã€ã¢ã³ããšã³ãžãã¢ããã«ã¿ã€ã ã§æ¹åããŠããäœå¶ã«å€ãã£ããªã©ãå€åããããŠé¢çœãã£ãã
ä»ãŸã§ã®reddit ã®æ€çŽ¢ã·ã¹ãã Blog èšäºã·ãªãŒãºãé¢çœããã ã£ãã®ã§èªããã
Search at ShopifyâRange in Data and Engineering is the Future
Relevant Search èè Doug Turnbull ããã® Shopify ã®æè¡ããã°ã
ããããã¯æèš³
æ€çŽ¢ããŒã ãã©ãæ©èœãããã«ã€ããŠã Shopify ã®æ€çŽ¢ããŒã ã¡ã³ããŒã¯ããšã³ãžãã¢ã§ãããŒã¿ãµã€ãšã³ãã£ã¹ãã®ã©ã¡ãã§ããªãã代ããã«ãã®äž¡è ã®ã¹ãã«ãåæã«æŽ»çšããªããšãããªãå Žé¢ãå€ã ãããã€ãŸããšãããè¯ãæ€çŽ¢ã®ããã®æææ±ºå®ã«ã¯ãData science, Engineering ã®äž¡æ¹ã®ã¹ãã«ã䜵ãæã€ããšãå¿ èŠã§ããã
çç±ãšããŠã¯ãã©ã¡ããã ãã®ã¹ãã«ã ãšéçŸå®çãªæææ±ºå®ããã§ãããè¯ãæææ±ºå®ãã§ããªãããã äŸãã°ãData Scientist ã®ãµã€ãã§ã¯ãéçŸå®ç㪠Data ETL ãã¢ãã«ã®ã€ã³ãã°ã¬ãŒã·ã§ã³ããµãŒãã³ã°ã¯èªåã®ä»äºã§ã¯ãªããšèšãã ãããšã³ãžãã¢ã«ãã¹ãŠãä»»ããŠããŸãã éã«ãšã³ãžãã¢ã®ãµã€ãã§ã¯ãã©ã³ãã³ã°æ¹åã®ããã«å¿ èŠãªåŒãåºããè¶³ããã¢ã€ãã¢ãåºãªãããã®ç¥èããªããšãããžã§ã¯ãã¯å€±æããŠãçãæªãè§£æ±ºæ¹æ³ãè¡ã£ãŠããŸããåªãããšã³ãžãã¢ãªã³ã°ãšã¯ãå¶çŽãèæ ®ããäžã§æåã®æææ±ºå®ãäžãããšãªã®ã§ãData Science é åã®å¶çŽãèæ ®ã§ããªãããšã«ã¯æ€çŽ¢æ¹åãããããšã¯é£ããã
Shipify ã§ã¯ãæ€çŽ¢ã®ããã«ãšã³ãžãã¢ãªã³ã°ãData Science ã®ç·åŒãããããšãªããæ€çŽ¢æ¹åã®ãããªãäž¡è ã®é åãæŽ»çšããããšã«ç©æ¥µçã ã Data Scientist, ãšã³ãžãã¢ã®ïŒã€ã®ããŒã ãããã®ã§ã¯ãªããæ€çŽ¢ããŒã ã«ã¯äžã€ã®ããŒã ãããããŸãããäºãã«ã©ã¡ããã®é å°ã奪ãåãæµå¯Ÿããé¢ä¿æ§ããªãã
Data Science ãš Enginnering ã®æªæ¥ãšããŠäž¡è ãèªç±ã«è¡ãæ¥ããããšãããæ€çŽ¢æ¹åã«ã€ãªãã
ãšããã¡ããã¡ãè¯ãããšæžããŠããŠæåããŸããã èªåãèããŠããããšãšå®å šã«äžç·ã§ããããã£ã±ããã®äžçã£ãŠè¯ããããšèªå·±ã®æ¹åæ§ã®æ±ºæã«èªèº«ãäžä¹ããããæèŠãããã
Doug ããã®èšäºãèªãã ã®ã¯åããŠãªã®ã§ãããæç« ãèªãŸããæç« ã§èªãã§ãŠæ¥œãã…! ç©æ¥µçã«ããããªèšäºãéå»ã«æžããŠãããŠããã®ã§ããããåŸã ã«èªãã§ããããã
SearchSage: Learning Search Query Representations at Pinterest
ISIR-eCom 2022 at WSDM workshop
sigir-ecom ã® WSDM çã®ãããªã¯ãŒã¯ã·ã§ãããsigir-ecom ãããã¯ã·ã¹ãã é¢ã«ç¹åããè²¢ç®ãèªããããŠããã®ã§ãæçš¿è«æãå ¬éãããã®ãéåžžã«æ¥œãã¿ãç¶ç¶çã«ãã²ãšãéå¬ããŠã»ãã
LINE music ã§ã® elasticsearch ã®è² è·è©Šéšããæ€çŽ¢æ©èœè¿œå ã«ãã£ãŠãè² è·ãéè«ããéã«ã©ã解決ãããã玹ä»ã8000 äžæ²èŠæš¡ã®æ€çŽ¢ãæ±ããã®ã¯æ¥œãããã
Elasticsearch Learning to Rank ãã©ã°ã€ã³ã®äœ¿ãæ¹ãšãã€ã³ã
ZOZO ã§ã® elasticsearch plugin ãå©çšãã Learning to Rank ã®è§£èª¬èšäºã詳现㫠Pros, Cons ã説æãããŠããŠãå匷ã«ãªã£ãã
DMM ã®æ€çŽ¢ã«æ©æ¢°åŠç¿ãå°å ¥ããŠãA/B ãã¹ãã§å§åããèãæ¹
çç±ãšããŠã¯ãåæã§è€éãªããšããããããã·ã³ãã«ãªãã®ããå§ããã©ã®ç¹åŸŽãå¹ãã®ãç¥èŠãæºããŠããæ¹ãè¯ããšå€æããããã§ãã
ãã®ç¹ããã¡ããã¡ãçŽ æŽãããã§ããã ããžã¿ã«ã³ã³ãã³ãã§åšåº«ã®æŠå¿µããªãã®ã§ããããéžãã§ããç¹ãªã©ãµãŒãã¹ç¹æ§ã«æ²¿ã£ãŠãããããé£ããããšãããã«åçŽãªãã¶ã€ã³ãåžžã«æèããŠããç¹ãéåžžã«ç§éžãããããžã§ãã®ã¯ãŒã¯ãããŒãšã³ãžã³ã®tektonã¯åããŠç¥ã£ãã
å ããªå¹æã§ãé©ãã»ã©ã®ææã«ç¹ãããŸããã€ã³ãã¯ããäžããããæœçã宿œã§ããããšã¯æ¥æ¬æå€§çŽã®ãã©ãããã©ãŒã ãæ±ã£ãŠãã DMM ãªãã§ã¯ã ãšæã£ãŠããŸãã
ããããŸãã課é¡è§£æ±ºã§ãã¡ã°ã倧äºãªã€ã³ãã¯ãã倧ããªåé¡ãè§£ãã¹ãã培åºããŠããŠãåãã
ãšãã®èšäºãèŠãŠãããèªåãæ€çŽ¢æ¹åé 匵ãããšå æ°ãããããèšäºã§ããã
EC æ€çŽ¢åºåæ ã®ç²ŸåºŠããã¯ãšãªæå³ã®æ©æ¢°åŠç¿ã§æ¹åãã話
æ°ã«ãªã£ãç¹ãšããŠã¯ããªãã©ã€ã³è©äŸ¡ã§ã¯ã©ãŠããœãŒã·ã³ã°ãè¡ã£ãéã«ãã¢ãã« Aã»B ã®æ¯èŒãè¡ã£ãŠããããã¢ãã«ãªãã®è©äŸ¡ãè¡ã£ãŠã©ããããçµæãããžãã£ãã«ãªãã®ãã¯èŠãŠã¿ããã£ãã äŒç€Ÿå ã§éçšãããŠããã¯ã©ãŠããœãŒã·ã³ã°ããããªããŠãå€§èŠæš¡ãµãŒãã¹ã®ç¹æš©ã§ããã æ¬²ãèšããªããAB ã§å®éçã«ã©ãããæ¹åãããã®ããç¥ãããã£ã…(å ¬éããŠãããšããã®ã»ããçããã)
æ å ±æ€çŽ¢ã»æ€çŽ¢æè¡ Advent Calendar 2021
å¢ãã§äœæãããæ å ±æ€çŽ¢ã»æ€çŽ¢æè¡ Advent Calendar 2021ãã§ã¯ã13 ä»¶ã®èšäºãæçš¿ããŠããã ããŸãããèªåã 2 ä»¶ã®èšäºãæçš¿ããã®ã§ãèå³ã®ããæ¹ã¯åŸ¡èЧãã ããã
Amazon ã®æ€çŽ¢ã«èå³ããã£ãã®ã§ãã·ã¹ãã ãµã€ãã»ã©ã³ãã³ã°ããžãã¯ã®è§£èª¬èšäºãæèš³ããŸããã
- Amazon ã e ã³ããŒã¹æ€çŽ¢ã Lucene ã«ãããã©ãã¹ã±ãŒã«ãããŠããã at Berlin Buzzwords 2019
- [æèš³] Daria Sorokina ããã«ããã Amazon æ€çŽ¢ã§ã®è£œåã®ã©ã³ãã³ã°ä»ãã®æ¥œãã at MLconf SF 2016
Machine Learning & Engineering
On-device one-shot learning for image classifiers with Classification-by-Retrieval
ããã€ã¹äžã§ã® one-shot åŠç¿ã®æŽ»çšèšäºããã¢ã¢ããªã®åºæ¥ãéåžžã«é«ããiPhone ã§äºåã«ç»åã®ã«ããŽãªããšã«åããã¢ã«ãã (åã«ããŽãªã¯å€ããŠã 4 æçšåºŠ!?)ãäœæããŠãããŠããããèªã¿èŸŒãã°ã¢ããªäžã§ç»åèªèã¢ãã«ãäœæã§ããŠãåããŠããŠé©ãã
ãã®åç»ãèŠãããšã ãšã誰ããèªèã¢ãã«ãããããã«äœæããŠæŽ»çšããæªæ¥ãèŠããŠããŸã£ã…
Redesigning Etsyâs Machine Learning Platform
Etsy ãå éšã®æ©æ¢°åŠç¿åºç€ãå·æ°ãã話ã2017 幎ã«å°èŠæš¡ãªããŒã¿ãµã€ãšã³ã¹ããŒã ãããžã¹ãã£ãã¯ååž°ã®ã¢ãã«ã掻çšããŠããåµäžèšã®æä»£ãããæéããã¡ã¢ãã«ã®è€éæ§ã¯ãŸãåæã®åºç€ã§ã¯ãã¡ã³ãã³ã¹ããã€ããªã£ãŠããã®ã§ V2 ã®åºç€ãäœæãåºæ¬æ¹éãšããŠã¯å 補éçºã¯ãã㊠OSS ãç©æ¥µæ¡çšã®æ¹éã«ã
V2 ã§ã¯ã
- ETL: Dataflow
- Prototyping: Jupyter Notebooks
- Training: Vertex AI
ãš GCP ãµãŒãã¹ããã«æŽ»çšããæ§æ
é¢çœãã£ãç¹ãšããŠã¯ãServiing ã§ã¯å 補ã®ãµãŒãã³ã°ã·ã¹ãã ãã¢ãã«ã®è©äŸ¡é¢ã§ã¯ãã¯ãå§åçã«äŸ¿å©ãªã®ã§ OSS æ¡çšååãæ€åããŠããã ãã¯å 補ã·ã¹ãã ãæ¡çšãšããæè»ãªæææ±ºå®ãè峿·±ãã£ãã
V2 ãžã®å·æ°ã«ãã£ãŠãçæ³ãããããã¯ã·ã§ã³ãªãªãŒã¹ãŸã§ã®æéã 5 å²ã»ã©åæžãããŠããããããããã¯å€§ããªææã
Google Research: Themes from 2021 and Beyond
Jeff Dean ãå ¬éãã Google Research ãã 2021 幎ããã®ããŒããšããããäœãè¡ããã®è§£èª¬ã
å人çã«æ°ã«ãªã£ãã®ã¯ãæ±ç𿩿¢°åŠç¿ã¢ãã«ã®é©çšçµæãMUMã§ã質çå¿çã¢ãã«ãæ€çŽ¢ã«é©çšãããªã©éåžžã«éå¿çãªåãçµã¿ãããããã·ã¹ãã çãªéçšã粟床ã®ä¿å®ã»ã¡ã³ãã³ã¹ãªã©è£åŽã§ã¯ã©ãåããŠããã®ãéåžžã«æ°ã«ãªãã
A decade in deep learning, and what’s next
20 幎åã« Google ã¯æ©æ¢°åŠç¿å©çšãéå§ã10 幎åã«ã¯æ·±å±€åŠç¿ã®é©çšãå§ãããVP, Responsible AI and Human-Centered Technology ã® Marian ãããšçãããåç¥ã® Jeff Dean ãæ·±å±€åŠç¿ã§ã® 10 幎ã§äœãèµ·ãããããããŠããããäœãèµ·ãããã解説ã
ç ç©¶ããçŸå®äžçã§ã®çšŒåã«ç§»è¡ããã㊠Google ã®ãµãŒãã¹ãžã®æ©æ¢°åŠç¿é©çšäºäŸã玹ä»ã幎éã« 1000 æ¬è¿ãè«æãå ¬éããŠãGoogle AI 㯠10 幎éã§ã¯ 6500 以äžã®ç¹èš±ãååŸããŠãããããã
ãŸããæ©æ¢°åŠç¿ã®æŽ»çšããœãŒã·ã£ã«ããããŒã«ç¹ããããããã«æè³ããŠãããããã
The 10 most read research papers of 2021
Amazon Science ã 2021 幎ã«åºçããè«æã®ãã¹ã 10 ã玹ä»ã ãSeasonal relevance in e-commerce searchãã§ã¯ã39%ã®ã¯ãšãªãã·ãŒãºããªãã£ã«ãã£ãŠ relevance ãäŸåããŠãããAB ãã¹ãã§ã¯ 2.2%è³Œå ¥çãåäžããããããæ³å以äžã«ã€ã³ãã¯ããåããŠã³ã£ããã
ãReducing Amazonâs packaging waste using multimodal deep learningãã§ã¯ã深局åŠç¿ã䜿ã£ãŠã2015 幎ããæ¯èŒãããš 36%ã®ããã±ãŒãžãåæžã㊠100 äžãã³ã®ããã±ãŒãžã®äœåãªæåºãæããŠãããšã®ããšããœãŒã·ã£ã«ããããŒãªåé¡ã®å žåã§ãããããæ©æ¢°åŠç¿ã®ã¹ã±ãŒã«ã¡ãªãããæŽ»ãããå žåçãªè¯åã ãª~ãšæå¿ããã
S3 ã®ã·ã¹ãã è«æãšããé¢çœããã ãããã£ãšèªãã§ã çè§£ãé£ããã£ãã
The top Amazon Science blog posts of 2021
Amazon Science ã 2021 å¹Žã«æžãã Blog èšäºã® Top10 ã®ç޹ä»ãAmazon ã®ãããšããããåéã«æ©æ¢°åŠç¿ã®å®çšåãå³ãããšããŠããå§¿å¢ãäŒãã£ãŠããã®ã§ãAmazon Science ã®èšäºã¯éåžžã«é¢çœãã Learning to Rank ã䜿ã£ãŠãè·ç©ã床ã®çé¢ã«çœ®ããã¹ãããäºæž¬ãããšããèšäºãé¢çœããã ã£ãã
Metaâs AI team working on harmful Facebook posts moved to AR / VR unit
Meta (æ§å Facebook) ã®æ©æ¢°åŠç¿ã«ããé忀ç¥ããŒã ã¯ããªãã®æè³ããããŠããŠãKDD2020 ã® keynote speaker ã§ã¯ Alon ãããPreserving Integrity in Online Social Mediaãšããé¡ç®ã§ãç£èŠæ¥åãæ©æ¢°åŠç¿ã«ããå¥å šãªç°å¢ãä¿ã€ããã«äœãè¡ã£ãŠããããè¬æŒããŠããããããã Meta ã§ã®å¥å šåã«ãªãœãŒã¹ãéã£ãŠãããšãã Meta ãžã®å ¥ã蟌ã¿å ·åãéåžžã«ããããã¥ãŒã¹ã
èªåã®æåã®æ©æ¢°åŠç¿ã¿ã¹ã¯ãé忀ç¥ãšå¥å šåã ã£ãã®ã§ããã®åéã«ã¯éåžžã«èå³ãããã
ããã®èšäºçµç±ã§ç¥ã£ãè«æã§é¢çœãããªãã®ã¯ Blog ã§ãã£ãšè§£èª¬èšäºãæžããŠããã¥ãŒã¹ã¬ã¿ãŒã§æ·±å ãããŠç޹ä»ãããããªãšæããŸãã
ææ³
Twitter ã§ #searchengineeringnewsletter ãã€ããŠã€ã¶ãããŠããã ããã Google ãã©ãŒã ã§ã®ææ³æçš¿ããåŸ ã¡ããŠãããŸãã
newsletter ã®ã¿ã°ãä»äžããèšäºã® RSS1 ãäœæããŠããŸãã ææã¡ã® RSS ãªãŒããŒã«ç»é²ããŠããã ããã° newsletter ã®æŽæ°ãææ¡ããããšãã§ããŸãã
newsletter RSS: https://shunyaueta.com//tags/newsletter/index.xml ↩︎
See Also
- Search Engineering Newsletter vol.00
- Amazonæ€çŽ¢ã©ã³ãã³ã°ã«åãçµã楜ãã at MLconf SF 2016
- Amazonãeã³ããŒã¹æ€çŽ¢ã Lucene ã«ãããã©ãã¹ã±ãŒã«ãããŠããã at Berlin Buzzwords 2019
- ã¯ãšãªåé¡(Query Classification) ã«ã€ããŠç€Ÿå ã®å匷äŒã§è©±ããŠãã
- eã³ããŒã¹ã®æ€çŽ¢ãšæšèŠã«ã€ããŠã®ãµãŒãã€è«æã§ãã 'Challenges and research opportunities in eCommerce search and recommendations' ã瀟å å匷äŒã§çºè¡šãã